GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



September 29, 2004, 18:48:38 ; Search time 23.4674 Seconds 

(without alignments) 
791.964 Million cell updates/sec 

SEQ2_1-171_975-1163 
1823 

1 MEDIDQSSLVSSSTDSPPRP . . VKDAMAKIQAKIPGLKRKAD 360 

BLOSUM62 

Gapop 10.0 , Gapext 0.1 



389414 



Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6 : /cgn2__6/ptodata/2/iaa/backfilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
US-08-700-607-1 

Sequence 1, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT : Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 



COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-8 45-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 199 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: 

CLONE: Consensus 
US-08-700-607-1 

Query Match 49.6%; Score 904; DB 2; Length 199; 

Best Local Similarity 97.3%; Pred. No. le-33; 

Matches 183; Conservative 3; Mismatches 2; Indels 0; Gaps 0; 

Qy 173 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 232 

I I M 1 I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 WDLL YWRDI KKT G WFGAS L FLLLS LT VFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI 71 

Qy 233 QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVL 292 

I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I M I I I I I I 1 I I I I I I I I I I I I I I I I 
Db 72 QKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVL 131 

Qy 2 93 MWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI 352 

I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I ! I I I I I I I I I 
Db 132 MWVFTWGALFNGLTLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKI 191 

Qy 353 PGLKRKAD 360 

I I I I 1 I I : 
Db 192 PGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 



TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1*5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/7 00 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 77 6 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307307 
US-08-700-607-5 

Query Match 45.3%; Score 826.3; DB 2; Length 776; 

Best Local Similarity 27.7%; Pred. No. 4.6e-29; 

Matches 195; Conservative 52; Mismatches 91; Indels 367; Gaps 17; 

Qy 10 VSSSTDS PPR 19 

II I : I II: 
Db 85 VSSAMDHTFSTTSKDGEGSCYTSLISDICYPPQEDSTYFTGILQKENGHVTISESPEELG 144 

Qy 2 0 -PPP AFKYQFVTEPED 34 

II I : I I : I II : 

Db 145 TPGPSLPDVPGIESRGLFSSDSGIEMT PAESTEVNKILADPLDQMKAEAYKYIDITRPEE 204 

Qy 35 EEDEEEEEDEEED DED 50 

: : I : III II, 
Db 205 VKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDKPAPVEGKIIKDHLLEESTFAPYI 264 

Qy 51 LEELE 55 

I I : I 

Db 2 65 DDLSEEQRRAPQITTPVKITLTEIEPSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPE 324 



QY 



56 



VLERKPAAGLS 



66 



: I I II I 

Db 325 DDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRP 384 

Qy 67 AAAVPPAAAAPLLDFSSDSV PPA 8 9 

I : I I : I I I : I II : 

Db 385 EVKARS GP PT IPS P- LDHEAS SAE S GDS E I ELVS EDPMAAEDALP S G YVS FGHVGGP P P S 443 

Qy 90 PRGPL PAAPPA 100 

II III 
Db 444 PASPSIQYSILREEREAELDSELIIESCDASSASEESPKREQDSPPMKPSALDAIREETG 503 

Qy 101 — APERQPS WERSPAAPAPSLPPA AAVLPSKLPEDD E 135 

I II I I : : I I I I I : I I I I I : I : 

Db 504 VRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGALEPETPMLPRK-PEEDSSSNQ 562 

Qy 136 PPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWRDIKKTGWFGASLFL 195 

II III 1 I I I I : : : I I I I I I I I I : I I : I II : I I 

Db 563 SPAATKGPGPLG PGAPPPLLFLNKQK AI DLLYWRDI KQTGI VFGS FLLL 611 

Qy 196 LLS LTVFS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRAYLES EVAI SEEL 255 

I III I I : I I I I I : I II II I I I I I I I I I : I I : I I : I I I I I I : I I I I I : : I : I 
Db 612 LFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQ 671 

Qy 256 VQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALIS 315 

: I I I : : : I I I I : I I I II I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : : I : : I 

Db 672 IQKYTDCLQFYWSTLKELRRLFLVQDLWSLKFAVLMWLLTYVGALFNGLTLLLMAVVS 731 

Qy 316 LFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

: I : : I I : I : I I III I I I I : : I I i I I I I I I II I : 

Db 732 MFTLPVVYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT:. Goli f Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals , Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE : CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/700, 607 



FILING DATE: Filed Herewith 
; ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 
; TELEFAX: 415-845-4166 

; INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 356 amino acids 

TYPE : amino acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307309 
US-08-700-607-6 

Query Match 43.3%; Score 790; DB 2; Length 356; 

Best Local Similarity 44.1%; Pred. No. 4.6e-28; 

Matches 177; Conservative 47; Mismatches 87; Indels 90; Gaps 15; 

Qy 2 EDIDQSSLVS-SSTDSPPRPP— PAFKYQFVTEPEDEE-DEE EEEDEE 45 

II III | | | | : : | : | : | | | : 

Db 4 EDALPSGYVSFGHVGGPPPSPASPSIQYSILREEREAELDSELIIESCDASSASEESPKR 63 

Qy 46 EDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSV PPAPRGPL 94 

II 11:1 I : : I : I I 

Db 64 EQDS PPMKPSAL DAI RE ET GVRAE E RAP S RRG — 95 

Qy 95 PAAPPAA PERQPSWERSPAAPAPSLPPA AAVLPSKLPEDD EPPAR 139 

II: I : I I I I I I : II I I I : I : I I 

Db 96 -LAEPGSFLDYPSTEPQ PGPELPPGDGALEPETPMLPRK-PEEDSSSNQSPAA 146 

Qy 140 PPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWRDIKKTGWFGASLFLLLSL 199 

III I I I I I : : : I I I I I I I I I : I I : I I I : I I I I I 

Db 147 TKGPGPLG PGAPPPLLFLNKQK AIDLLYWRDIKQTGIVFGSFLLLLFSL 195 

Qy 200 TVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKY 259 

I I I : I I I I I : I I I II I I I I I I I I I : I I : I I :| I I I I I : I I I I I : : I : I : I M 
Db 196 TQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKY 255 

Qy 260 SNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSI 319 



Db 2 56 TDCLQFYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTWGALFNGLTLLLMAWSMFTL 315 

Qy 320 PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 360 

I I : I : I I I I I i I II : : II I I I I I I I II I : 

Db 316 PWYVKHQAQ I DQYLGLVRTH I NAWAKI QAKI P GAKRHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7 , Application US/08700607 
; Patent No. 5858708 



; GENERAL INFORMATION: 

APPLICANT : Bandman, Olga 

APPLICANT: Au- Young, Janice 

APPLICANT: Goli, Surya K. 
; APPLICANT: Hillman, Jennifer L. 

; TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE : Incyte Pharmaceuticals, Inc. 
; STREET: 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP : 94304 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
; REFERENCE/DOCKET NUMBER: PF-0114 US 

; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 415-855-0555 

; TELEFAX : 415-845-4166 

; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 208 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
; LIBRARY: GenBank 

; CLONE: 307311 

US-08-700-607-7 

Query Match 37.5%; Score 683; DB 2; Length 208; 

Best Local Similarity 67.4%; Pred. No. 1.2e-23; 

Matches 128; Conservative 31; Mismatches 31; Indels 0; Gaps 0 
Qy 171 S SWDLLYWRDIKKTGWFGASLFLLLSLTVFS IVSVTAYIALALLSVTI S FRI YKGVIQ 230 




Db 



19 SQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQ 



78 



Qy 



231 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 



290 





Db 



7 9 AVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFA 



138 



Qy 



291 VLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QA 



350 




Db 



13 9 VLMWLLTWGALFNGLTLLLMAWSMFTLPWYVKHQAQIDQYLGLVRTHINAWAKIQA 



198 



Qy 351 KIPGLKRKAD 360 

I I I I II I : 
Db 199 KIPGAKRHAE 208 



RESULT 5 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION : 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 00 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-8 4 5-4166 
INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 267 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 36.7%; Score 669; DB 2; Length 267; 

Best Local Similarity 66.7%; Pred. No. 8.1e-23; 

Matches 124; Conservative 32; Mismatches 30; Indels 0; Gaps 0 

Qy 171 SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 230 

I : I I I I I I I I I : I I : I I I : I II III I I : I I I I I : I I I I I I II I I I I I I : I 
Db 10 SQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYIALAALSATISFRIYKSVLQ 69 



Qy 



231 



AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFA 290 





Db 



70 



AVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKFA 129 



QY 



291 



VLMWVFTYVGALFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QA 35 0 




Db 



130 



VLMWLLTWGALFNGLTLLLMAWSMFTLPVVYVKHQAQVDQYLGLVRTHINTWAKIQA 18 9 



Qy 



351 



KIPGLK 356 



Db 



190 



KIPGAR 195 



RESULT 6 
US-08-700-607-3 

; Sequence 3, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 

APPLICANT: Au- Young, Janice 

APPLICANT: Goli, Surya K. 
; APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

NUMBER OF SEQUENCES: 9 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

; STREET: 3174 Porter Drive 

; CITY: Palo Alto 

; STATE: CA 

; COUNTRY: U.S. 

; ZIP: 94304 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 3: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 241 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THPlNOBOl 

CLONE: 31870 



US-08-700-607-3 



Query Match 30.1%; Score 548.1; DB 2; Length 241; 

Best Local Similarity 44.4%; Pred. No. 2e-17; 

Matches 111; Conservative 40; Mismatches 60; Indels 39; Gaps 3 

Qy 109 ERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRG 168 

III:::: I I I M I II I 
Db 3 ERXAATQSHSISSSSF GAEPSAPGGGGSPGACPA LGTKSCS 43 

Qy 169 SGSSVVT)LLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAL^LSVTISFRIYKGV 228 

I : I II: llhllll III : I : I I I I I I : : I I : I : II I I I I I I I I I I I I I 
Db 44 SSCAVHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSV 103 

Qy 229 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 288 

I I I : I I I : I I I I I : I I I : : : : I I I I : I : I : I : I : I I I I I : I I II I I I 

Db 104 IQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMN7y\MVHINRALKLIIRLFLVEDLVDSLK 163 

Qy 289 FAVLMV^FTWGALFNGLTLLILALISLFSIPVIYERHQVQ 348 

I I I I : I I I I I : I I I : II I I I I : : I : I : : I : : I : 

Db 164 LAVFMWLMTYVGAVFNGITLLILAELLIXSVPIVYXKY KV 203 

Qy 349 QAKIPGLKRK 358 

: I I : : I 

Db 2 04 PSKTPWNRQK 213 



RESULT 7 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149,476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/04493 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,4 92 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 



EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,862 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,887 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,908 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/048,964 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/057,650 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/056,884 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,669 
EARLIER FILING DATE: 1997-09-05 
EARLIER APPLICATION NUMBER: 60/049,610 
EARLIER FILING DATE: 1997-06-13 
EARLIER APPLICATION NUMBER: 60/061,060 



; EARLIER FILING DATE: 1997-10-02 



Query Match 29.6%; Score 539.9; DB 4; Length 168; 

Best Local Similarity 59.3%; Pred. No. 2.4e-17; 

Matches 99; Conservative 36; Mismatches 31; Indels 1; Gaps 1; 

Qy 195 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 254 

: I I I I I I : : I I : I : I II II I II I I i I I I I I I I : I I I : I I I I I : II I : : : : I I 
Db 1 MLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 

Qy 255 LVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 314 

I I : I : I : I : I : I I I I I : I I I I I I I II I I : I I I I I : I I I : I M I I I : 
Db 61 AFHNYMNAAMVliINRALKLIIRLFLVEDLVDSLKLA.VFMWLMTYVGAVFNGITLLILAEL 120 

Qy 315 S LFS I PVI YERHQVQI DHYLGLANKS VKDAMAKIQAKI PGL-KRKAD 360 

: I I : I : : I I : : : I I I I I : I : I I : I I I I I : I I : I : I I : 

Db 121 LIFSVPIVYEKYKTQIDHYVGIARDQTKSI VEKIQAKLPGIAKKKAE 167 



RESULT 8 

US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/ 09/149 , 47 6 
; CURRENT FILING DATE: 1998-09-08 
; EARLIER APPLICATION NUMBER: PCT/US9 8/ 044 93 
; EARLIER FILING DATE: 1998-03-06 
; EARLIER APPLICATION NUMBER: 60/040,162 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,333 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 



EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 



; EARLIER APPLICATION NUMBER: 60/047,586 

EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,590 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,594 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,578 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/043,576 

; EARLIER FILING DATE: 1997-04-11 

EARLIER APPLICATION NUMBER: 60/047,501 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,670 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/056,632 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,664 

EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,876 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,881 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,909 

; EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/056,875 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,862 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,887 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 

; EARLIER FILING DATE: 1997-10-02 

Query Match 15.7%; Score 286; DB 4; Length 92; 

Best Local Similarity 56.0%; Pred. No. 2.5e-06; 

Matches 51; Conservative 21; Mismatches 19; Indels 0; Gaps 0; 

Qy 261 NSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVG7VLFNGLTLLILALISLFSIP 320 

I : I : I : I : I : I I I I I : I I I I I II II II: II I I I : II I : I I I I II : : I I = I 



Db 



2 NAAMVHINRALKLI I RLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLI LAELLI FSVP 



Qy 321 VIYERHQVQIDHYLGLANKSVKDAMAKIQAK 351 

: : I I : : : I I I I i : I : I I : I I : I 

Db 62 IVYEKYKTQIDHYVGIARDQTKSIVEKIPSK 92 



RESULT 9 

US-09-621-976-4600 

Sequence 4600, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET . 054PR2 
CURRENT APPLICATION NUMBER: US/ 09/ 62 1 , 97 6 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 4600 
LENGTH: 7 5 
TYPE: PRT 

ORGANI SM : Homo s api ens 
FEATURE: 

NAME/KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa = His, Pro 
NAME/KEY: UNSURE 
LOCATION: 2 8 

OTHER INFORMATION: Xaa = Met,Val 
NAME/ KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4600 

Query Match 14.4%; Score 262.5; DB 4; Length 75; 

Best Local Similarity 67.0%; Pred. No. 2e-05; 

Matches 59; Conservative 3; Mismatches 11; Indels 15; Gaps 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 6 

I I I : I I I M I M I I I II I I I I I I I I I II I I I I I I I I M I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRXQPAFKYQFXREPEDEE EDLEELEVLERK 4 

Qy 61 PAAGLSAAAV — PPAAAAPLLDFSSDSV 8 6 

I I I I I I I 1 I I I I I I : I I : I I 
Db 48 PAAGLXAAPVXTAPAAGAPLMDFGNDFV 75 



RESULT 10 

US-09-621-976-4601 

; Sequence 4601, Application US/09621976 
; Patent No. 6639063 



GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET . 054 PR2 
CURRENT APPLICATION NUMBER: US/09/621,976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 4601 
LENGTH: 75 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME /KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa = His, Pro 
NAME/ KEY: UNSURE 
LOCATION: 2 8 

OTHER INFORMATION: Xaa = Met,Val 
NAME/ KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4601 

Query Match 14.4%; Score 262.5; DB 4; Length 75; 

Best Local Similarity 67.0%; Pred. No. 2e-05; 

Matches 59; Conservative 3; Mismatches 11; Indels 15; Gaps 3; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I 1 I I I I I I I I I I M I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRXQPAFKYQFXREPEDEE EDLEELEVLERK 4 7 

Qy 61 PAAGLSAAAV — PPAAAAPLLDFSSDSV 8 6 

I I I I I II I I I I I I I : I I : I I 
Db 48 P AAGLXAAP VX T AP AAGAP LMD F GN D FV 75 



RESULT 11 
US-09-080-897-4 

Sequence 4, Application US/09080897 
Patent No. 5985574 
GENERAL INFORMATION: 

APPLICANT : King, Mary-Claire 
APPLICANT: Lynch, Eric D. 
APPLICANT: Lee, Ming 
APPLICANT: Morrow, Jan E. 
APPLICANT: Welcsh, Piri L. 
APPLICANT: Leon, Pedro E. 
TITLE OF INVENTION: Modulators of Actin 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 



STREET: 75 DENISE DRIVE 
; CITY: HILLSBOROUGH 

STATE : CALIFORNIA 

COUNTRY: USA 
; ZIP: 94010 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/080, 897 

FILING DATE: 

CLASSIFICATION: 
; ATTORNEY/AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UW97-001 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (650) 343-4341 

TELEFAX: (650) 343-4342 
; INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 1255 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY : linear 

; MOLECULE TYPE: peptide 
US-09-080-897-4 



Query Match 13.9%; Score 253.2; DB 2; Length 1255; 

Best Local Similarity 16.7%; Pred. No. 0.012; 

Matches 142; Conservative 43; Mismatches 126; Indels 538; Gaps 36 

Qy 2 EDIDQSSLVSSSTDSPPRPPPAFKYQFV TEPEDEED 37 

I : I I : I I III:: I I I : I 

Db 427 ECVSQIVLHKNGTD PDFKCRHLQIDIERLVDQMIDKTKVEKSEAKATELEKKLD 480 

Qy 3 8 EE EEEDEEEDDEDLE ELEV 56 

I | | | : : | | : III 

Db 481 SELTARHELQVEMKKMENDFEQKLQDLQGEKDALDSEKQQITAQKQDLEAEVSKLTGEVA 54 0 

Qy 57 LE--RKPAAGLSA AAVP PAAAAPLLDFSSDSV PPAPRG 92 

||: III! I I I I I III I : I I I I I 

Db 541 KL S KELEDAKN EMAS L S AVWAP S VS S S AAVP P APPLPGDSGTVI PPPPPPPPLPGG 597 

Qy 93 PLPAAPPAAP ERQPSWERSPAAPAP SLPP 121 

: I : I I I I I I I I I : I I 

Db 598 WPPSPPLPPGTCIPPPPPLPGGACIPPPPQLPGSAAIPPPPPLPGVASIPPPPPLPGAT 657 

Qy 122 AAAVLP SKLPEDDEP PARPPPPPPAGA 14 8 

I |: I : :| I I MINI 

Db 658 AIPPPPPLPGATAIPPPPPLPGGTGIPPPPPPLPGSVGVPPPPPLPGGPGLPPPPPPFPG 717 

Qy 149 SP LAEPAAPP STPAAP KRRGSGSSW — DL — 176 

: I : I I I MM II : I I II 

Db 718 APGIPPPPPGMGVPPPPPFGFGVPAAPVLPFGLTPKKVYKPEVQLRRPNWSKFVAEDLSQ 777 



Qy 177 -LYWRDIK 183 

: I : I 

Db 77 8 DCFWTKVKEDRFENNELFAKLTLAFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDS 837 

Qy 184 KT 1B5 

I I 

Db 838 KTAQNLSIFLGSFRMPYQEIKNVILEWEAVLTESMIQNLIKQMPEPEQLKMLSELKEEY 897 

Qy 18 6 GWFG ASLFLLLSLTVFS IVSVTA 2 09 

I I I I I I I I II II I I I I 

Db 8 98 DDLAE S EQ FGWMGT VP RL RP RLNAI L FKL QFSEQVENIKPEIVSVTAACEELRKS 953 

Qy 210 YIALALL SVTISF RIYKGVIQAI QKSDEG 238 

: I I I III III: : : 

Db 954 ENFSSLLELTLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSADQKMTLLHFLAELCEND 1013 

Qy 239 HP FRAYLESEVAISEELVQK 258 

II I :: I : I I : I I 

Db 1014 HPEVLKFPDELAHVEKASRVSAENLQKSLDQMKKQIADVERDVQNFPAATDEKDKFVEKM 1073 

Qy 259 YSNSALGHVN — STIKEL RRLFL 279 

I : : I I : I I I I : I I 

Db 1074 TSFVKDAQEQYNKLRMKHSNMETLYKELGDYFVFDPKKLSVEEFFMDLHNFRNMFLQAVK 1133 

Qy 2 80 VDDLVDSLKFAV 2 91 

: I I : : : I : 

Db 1134 ENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQ 1190 

Qy 292 LMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSV KDA 344 

Ml : | | : : I I I : II 

Db 1191 SGAAFR RKRGPRQVNRKAGCAVT S LLAS ELTKDDA 1225 

Qy 345 MAKIQAKIP 353 

M I : I 

Db 122 6 MAPGPVKVP 1234 



RESULT 12 
US-08-899-595-1 

; Sequence 1, Application US/08899595 
; Patent No. 6111072 

GENERAL INFORMATION: 

APPLICANT: Narumiya, Shuh 
; APPLICANT: Takahashi, No. 6111072uaki 

; TITLE OF INVENTION: RHO TARGET PROTEIN HUMAN MDIA AND GENE 

; TITLE OF INVENTION: ENCODING SAME 

NUMBER OF SEQUENCES: 14 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

; STREET: 3000. K Street, N.W., Suite 500 

;' CITY: Washington 

; STATE: D.C. 

COUNTRY: USA 
; ZIP : 20007-5109 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 



; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS /MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/8 99, 595 

FILING DATE: 24-JUL-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 8-242701 

FILING DATE: 26-AUG-1996 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: JP 9-90170 

FILING DATE: 25-MAR-1997 
; ATTORNEY/AGENT INFORMATION: 

NAME: Stephen A. Bent 
; REGISTRATION NUMBER: 29,768 

REFERENCE/DOCKET NUMBER: 049441/0112 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: (2 02)672-5300 

TELEFAX: (2 02)672-5399 

TELEX: 904136 
; INFORMATION FOR SEQ ID NO: 1: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 1255 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
US-08-899-595-1 



Query Match 13.9%; Score 253.2; DB 3; Length 1255; 

Best Local Similarity 16.7%; Pred. No. 0.012; 

Matches 142; Conservative 43; Mismatches 126; Indels 538; Gaps 36; 

Qy 2 EDIDQSSLVSSSTDSPPRPPPAFKYQFV TEPEDEED 37 

I : I I : i I Ml:: I I I : I 

Db 427 ECVSQIVLHKNGTD PDFKCRHLQIDIERLVDQMIDKTKVEKSEAKATELEKKLD 480 

Qy 38 EE EEEDEEEDDEDLE ELEV 56 

I I I I : : I I : I I I 

Db 481 SELTARHELQVEMKKMENDFEQKLQDLQGEKDALDSEKQQITAQKQDLEAEVSKLTGEVA 540 

Qy 57 LE--RKPAAGLSA AAVPPAAAAPLLDFSSDSV PPAPRG 92 

||: I I I I Mill III I : I I I I I 

Db 541 KL S KEL EDAKN EMAS L SAVWAP S VS S S AAVP P APPLPGDSGTVI PPPPPPPPLPGG 597 

Qy 93 PLPAAPPAAP ERQP SWERS PAAPAP SLPP 121 

: | : || I I I I I I I'M 

Db 598 WPPSPPLPPGTCIPPPPPLPGGACIPPPPQLPGSAAIPPPPPLPGVASIPPPPPLPGAT 657 

Qy 122 AAAVLP SKLPEDDEP PARPPPPPPAGA 148 

| | : | : : I I I I I I I I I 

Db 658 AIPPPPPLPGATAIPPPPPLPGGTGIPPPPPPLPGSVGVPPPPPLPGGPGLPPPPPPFPG 717 

Qy 149 SP LAEPAAPP STPAAP KRRGSGSSW — DL-- 176 

: I : I I I I I I I I I : I I I I 

Db 718 APGIPPPPPGMGVPPPPPFGFGVPAAPVLPFGLTPKKVYKPEVQLRRPNWSKFVAEDLSQ 777 



Qy 177 -LYWRDIK 183 

: I : I 

Db 778 DCE^TKVKEDRFENNELFAKLTLAFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDS 837 

Qy 184 KT 185 

I I 

Db 838 KTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKML5ELKEEY 897 

Qy 186 GWFG ASLFLLLSLTVFS IVSVTA 209 

I M I I I I I II I I 1 I I I 

Db 898 DDLAESEQFGWMGTVPRLRPRLNAI LFKL QFSEQVENIKPEIVSVTAACEELRKS 953 

Q y 210 YIALALL SVTISF RIYKGVIQAI QKSDEG 238 

: | || III |||: : : 

Db 954 ENFSSLLELTLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSADQKMTLLHFLAELCEND 1013 

Qy 239 HP FRAYLESEVAISEELVQK 258 

II i : : I : I I : I I 

Db 1014 HPEVLKFPDELAHVEKASRVSAENLQKSLDQMKKQIADVERDVQNFPAATDEKDKFVEKM 1073 

Qy 259 YSNSALGHVN— STIKEL RRLFL 279 

I : : I I : I I I I : I I 

Db 1074 TSFVKDAQEQYNKLRMMHSNMETLYKELGDYFVFDPKKLSVEEFFMDLHNFRNMFLQAVK 1133 

Qy 280 VDDLVDSLKFAV 2 91 

:| |:::h 

Db 1134 ENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLI DMNAEGDETGVMDS LLEALQ 1190 

Qy 292 LMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSV KDA 344 

Ml : | | : : I I I : II 

Db 1191 SGAAFR RKRGPRQVNRKAGCAVT S LLAS ELTKDDA 1225 

Qy 345 MAKIQAKIP 353 

II I : I 

Db 1226 MAPGPVKVP 1234 

RESULT 13 
US-09-323-735-4 

Sequence 4, Application US/09323735 
Patent No. 6197932 
GENERAL INFORMATION: 

APPLICANT: King, Mary-Claire 
APPLICANT : Lynch, Eric D. 
APPLICANT: Lee, Ming 
APPLICANT: Morrow, Jan E. 
APPLICANT: Welcsh, Piri L. 
APPLICANT: Leon, Pedro E. 
TITLE OF INVENTION: Modulators of Actin 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 75 DENISE DRIVE 
CITY: HILLSBOROUGH 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94010 



; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/323, 735 
; FILING DATE: 

; CLASSIFICATION: 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/080,897 
; FILING DATE: 

ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: UW97-001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (650) 343-4341 
; TELEFAX: (650) 343-4342 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 12 55 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 
US-09-323-735-4 

Query Match 13.9%; Score 253.2; DB 3; Length 1255; 

Best Local Similarity 16.7%; Pred. No. 0.012; 

Matches 142; Conservative 43; Mismatches 126; Indels 538; Gaps 



Qy 



2 EDIDQSSLVSSSTDSPPRPPPAFKYQFV- 



TEPEDEED 37 



Db 



42 7 ECVSQIVLHKNGTD 




Qy 



Db 




Qy 



Db 



57 LE — RK P AAGL S A AAVPPAAAAPLLDFSSDSV PPAPRG 92 

||: I I I I I I I II III I : I M I I 

541 KLSKELEDAKNEMASLSAWVAPSVSSSAAVPP APPLPGDSGTVIPPPPPPPPLPGG 597 



Qy 



Db 



93 PLPAAPPAAP ERQPSWERSPAAPAP SLPP 121 

: I : I I I I I I I I 1:11 

598 WPPSPPLPPGTCIPPPPPLPGGACIPPPPQLPGSAAIPPPPPLPGVASIPPPPPLPGAT 657 



Qy 



Db 



122 AAAVLP SKLPEDDEP PARPPPPPPAGA 148 

| | : | : : I I I I I I I I I 

658 AIPPPPPLPGATAIPPPPPLPGGTGIPPPPPPLPGSVGVPPPPPLPGGPGLPPPPPPFPG 717 



Qy 



149 SP LAEPAAPP STPAAP KRRGSGSSW — DL — 176 



Db 



. i i t i i i i i 

718 APGIPPPPPGMGVPPPPPFGFGVPAAPVLPFGLTPKKVYKPEVQLRRPNWSKFVAEDLSQ 777 



Db 77 8 DCFWTKVKEDRFENNELFAKLTI^FSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDS 837 

Qy 184 KT IBS 

I I 

Db 838 KTAQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKEEY 897 

Qy 186 GWFG ASLFLLLSLTVFS IVSVTA 209 

I II I I I I I II I I I I I I 

Db 8 98 DDLAESEQFGWMGTVPRLRPRLNAILFKL QFSEQVENIKPEIVSVTAACEELRKS 953 

Qy 210 YIALALL SVTISF RIYKGVIQAI QKSDEG 238 

: I I I III III: : : 

Db 954 ENFSSLLELTLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSADQKMTLLHFLAELCEND 1013 

Qy 239 HP FRAYLESEVAISEELVQK 258 

II I : : I : I I : I I 

Db 1014 HPEVLKFPDELAHVEKASRVSAENLQKSLDQMKKQIADVERDVQNFPAATDEKDKFVEKM 1073 

Qy 259 YSNSALGHVN — STIKEL -RRLFL 279 

I : : I I : I I I I : I I 

Db 1074 TSFVKDAQEQYNKLRMMHSNMETLYKELGDYFVFDPKKLSVEEFFMDLHNFRNMFLQAVK 1133 

Qy 2 80 VDDLVDSLKFAV 2 91 

: I I : : : I : 

Db 1134 ENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQ 1190 

Qy 292 LMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGIANKSV KDA 344 

Ml : I I : : I I I : II 

Db 1191 SGAAFR RKRGPRQVNRKAGCAVTSLLASELTKDDA 1225 

Qy 345 MAKIQAKIP 353 

II I : I 

Db 1226 MAPGPVKVP 1234 



RESULT 14 
US-09-080-897-2 

Sequence 2, Application US/09080897 
Patent No. 5985574 
GENERAL INFORMATION: 



APPLICANT: King, Mary-Claire 

APPLICANT: Lynch, Eric D. 

APPLICANT: Lee, Ming 

APPLICANT: Morrow, Jan E. 

APPLICANT: Welcsh, Piri L. 

APPLICANT: Leon, Pedro E. 

TITLE OF INVENTION: Modulators of Actin 

NUMBER OF SEQUENCES: 14 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 75 DENISE DRIVE 

CITY: HILLSBOROUGH 

STATE: CALIFORNIA 

COUNTRY: USA 

ZIP: 94010 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/080, 897 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: UW97-001 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (650) 343-4341 
TELEFAX: (650) 343-4342 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 124 8 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-080-897-2 

Query Match 13.9%; Score 252.5; DB 2; Length 1248; 

Best Local Similarity 15.3%; Pred. No. 0.012; 

Matches 140; Conservative 46; Mismatches 102; Indels 625; Gaps 39; 

Qy 1 MEDIDQ — SSLVSSSTDSPPRP 20 

I : I : : I : : : II I 

Db 378 MDDFNEVFQILLNTVKDSKAEPHFLSI LQHLLLVRNDYEARPQYYKLI EECI SQIVLHKN 437 

Qy 21 — PPAFKY QFVTEPEDEEDEEE E 41 

III | : : : | : | : I 

Db 438 GADPDFKCRHLQIEIEGLIDQMIDKTKVEKSEAKAAELEKKLDSELTARHELQVEMKKME 4 97 

Qy 42 EDEEEDDEDLE ELEVLE RKPAAGLS 66 

I I : : I I : III: : I I I I 

Db 4 98 SDFEQKLQDLQGEKDALHSEKQQIATEKQDLEAEVSQLTGEVAKLTKELEDAKKEMASLS 557 

Qy 67 AAA- - VP PA — AAAP L L D F S S D S VP PAP RG P L PA AP P AAP — ERQPSWERS P AA 114 

I I I I I I :: I I I I I I I I II I I I I I I 

Db 558 AAAITVPPSVPSRAP VPPAP — PLPGDSGTIIPPPPAPGDSTTPPPPPPPPP 607 

Qy 115 PAPSLPPAAAVLP SKLP 131 

I I I I I : I III 
Db 608 PPPPLPGGTAISPPPPLSGDATIPPPPPLPEGVGIPSPSSLPGGTAIPPPPPLPGSARIP 667 

Qy 132 EDDEPPARPP PPPPAGASPL 151 

I I I I I Mill: 
Db 668 PPPPPLPGSAGIPPPPPPLPGEAGMPPPPPPLPGGPGIPPPPPFPGGPGIPPPPPG M 724 

Qy 152 AEPAAPP STPAAP KRRGSGSSW--DL LYWRDIK 183 

III I I I I II : I : I II : I : I 

Db 725 GMPPPPPFGFGVPAAPVLPFGLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDR 784 



QY 



184 



KT 
I I 



185 



Db 



785 FENNELFAKLTLTFSAQTKTKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFR 844 



Qy 186 GWFG 190 

I I I I 

Db 845 MPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFGWMG 904 

Qy 191 ASLFLLLSLTVFS IVSVTAYIALA LLSVT 219 

I I I I II I I I I I I I I I : I 

Db 905 TVPRLRPRLNAI LFKL QFSEQVENIKPEIVSVTA ACEELRKSESFSNLLEIT 956 

Qy 220 I SFRI YKGVIQAIQK 234 

: II II:: 
Db 957 LLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKF 1013 

Qy 235 SDEGHPFRAYLESEVAISEELVQK 258 

II I : : I : I I : I I 

Db 1014 PDE LAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSF 1069 

Qy 259 YSNSALGHVN — STIKEL RRLFL 279 

I : : I I : I I I I : I I 

Db 107 0 VKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQ 112 9 

Qy 280 VD D LVD S L K FAVLMW 2 94 

: I I : : : I : 

Db 1130 KRRKTEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQ 1183 

Qy 295 VFTYVGALFN GLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKS 340 

III : I I II:: 
Db 1184 S GAAFRRKRG P RQAN RKAGCAVT S L LAS EL 1213 

Qy 341 VK-DAMAKIQAKI 352 

I I I I I : I I : 
Db 1214 TKDDAMAAVPAKV 1226 



RESULT 15 
US-09-323-735-2 

Sequence 2, Application US/09323735 
Patent No. 6197932 
GENERAL INFORMATION: 

APPLICANT: King, Mary-Claire 
APPLICANT: Lynch, Eric D. 
APPLICANT: Lee, Ming 
APPLICANT: Morrow, Jan E. 
APPLICANT: Welcsh, Piri L. 
APPLICANT: Leon, Pedro E. 
TITLE OF INVENTION: Modulators of Actin 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 75 DENISE DRIVE 
CITY : HILLSBOROUGH 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94010 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 



COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/32 3 , 735 

FILING DATE: 

CLASSIFICATION: 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 09/080,897 

FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/DOCKET NUMBER: UW97-001 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (650) 343-4341 
; TELEFAX : (650) 343-4342 

; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 1248 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-09-323-735-2 



Query Match 13.9%; Score 252,5; DB 3; . Length 1248; 

Best Local Similarity 15.3%; Pred. No. 0.012; 

Matches 140; Conservative 46; Mismatches 102; Indels 625; Gaps 39 

Qy 1 MEDIDQ — SSLVSSSTDSPPRP 20 

I : I : : I : : : II I 

Db 378 MDDFNEVFQILLNTVKDSKAEPHFLSILQHLLLVRNDYEARPQYYKLIEECISQIVLHKN 437 

Qy 21 — PPAFKY QFVTEPEDEEDEEE E 41 

III | : : : | : | : I 

Db 438 GADPDFKCRHLQIEIEGLIDQMIDKTKVEKSEAKAAELEKKLDSELTARHELQVEMKKME 497 

Qy 42 EDEEEDDEDLE ELEVLE RKPAAGLS 66 

II: : I I : III: : I I I I 

Db 4 98 SDFEQKLQDLQGEKDALHSEKQQIATEKQDLEAEVSQLTGEVAKLTKELEDAKKEMASLS 557 

Qy 67 AAA — VPPA — AAAPLLDFSSDSVPPAPRGPLPA APPAAP — ERQPSWERSPAA 114 

III III: : I I I I I I I III I I II I I 

Db 558 AAAITVPPSVPSRAP VPPAP — PLPGDSGTI IPPPPAPGDSTTPPPPPPPPP 607 



Qy 115 PAPSLPPAAAVLP SKLP 131 

I I I I I : I III 
Db 608 PPPPLPGGTAISPPPPLSGDATIPPPPPLPEGVGIPSPSSLPGGTAIPPPPPLPGSARIP 667 



Qy 132 EDDEPPARPP PPPPAGASPL 151 

I I I I I Mill: 
Db 668 PPPPPLPGSAGIPPPPPPLPGEAGMPPPPPPLPGGPGIPPPPPFPGGPGIPPPPPG M 724 



Qy 152 AEPAAPP STPAAP KRRGSGSSW — DL LYWRDIK 183 

III I I I I II : I : I II : I : I 

Db 725 GMPPPPPFGFGVPAAPVLPFGLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDR 7 84 



Qy 184 KT 185 

I I 

Db 785 FENNELFAKLTLTFSAQTKTKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFR 844 

Qy 18 6 GWFG 190 

III I 

Db 845 MPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFGWMG 904 

Qy 191 ASLFLLLSLTVFS IVSVTAYIALA LLSVT 219 

I I I I II I I I I I I I I I : I 

Db 905 TVPRLRPRLNAILFKL QFSEQVENIKPEIVSVTA ACEELRKSESFSNLLEIT 956 

Qy 220 I SFRI YKGVIQAIQK 234 

: II II:: 
Db 957 LLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKF 1013 

Qy 235 SDEGHPFRAYLESEVAI SEELVQK 258 

II I :: I : I I : I I 

Db 1014 PDE LAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSF 1069 

Qy 259 YSNSALGHVN--STIKEL RRLFL 279 

I : : I I : I I I I : I I 

Db 1070 VKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQ 1129 

Qy 280 VD D LVD S L K FAVLMW 2 94 

: | | : : : | : 

Db 1130 KRRKTEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQ 1183 

Qy 295 VFTYVGALFN GLTLLI LALI SLFS I PVI YERHQVQI DHYLGIANKS 340 

III : I I II:: 
Db 1184 S GAAFRRKRGP RQAN RKAGCAVT S L LAS EL 1213 

Qy 341 VK- DAMAKI QAKI 352 

I I I I I : II: 
Db 1214 TKDDAMAAVPAKV 122 6 



Search completed: September 29, 2004, 18:56:35 
Job time : 31.9674 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



September 29, 2004, 18:54:54 ; Search time 85.8807 Seconds 

{without alignments) 
1348.937 Million cell updates/sec 

SEQ2_1-171_975-1163 
1823 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKI PGLKRKAD 360 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.1 

Searched: 1351062 seqs, 321799191 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1351062 



Database 



Pub 1 i s h ed_App 1 i ca t i on s_AA 
/ cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/ cgn2_6/ptodata/ 2 /pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubpa 
/cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 
/ cgn2_6/ptodata/2/pubp 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



a/US07_PUBCOMB . pep : * 
a/PCT_NEW_PUB.pep: * 
a/US06_NEW_PUB.pep: * 
a/US0 6_PUBCOMB.pep:* 
a /US 0 7_NEW_PUB . pep : * 
a/PCTUS_PUBCOMB.pep: * 
a/US08_NEW_PUB.pep:* 
a/US 0 8_PUBCOMB . pep : * 
a/US 09A_PUBCOMB. pep: * 
aa/US09B_PUBCOMB . pep : ' 
aa/US09C__PUBCOMB.pep:^ 
aa/US09_NEW_PUB . pep : * 
aa/US10A_PUBCOMB.pep: * 
aa/US10B_PUBCOMB.pep: ' 
aa/US10C_PUBCOMB.pep: ■* 
aa/US10_NEW_PUB.pep: * 
aa/US60_NEW_PUB.pep: * 
aa/US60_PUBCOMB.pep: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 



1 


1817 


99 . 


7 


360 


9 


US-09-893-348-20 


Sequence 20, Appl 


2 


1808.1 


99 . 


2 


379 


14 


US- 10-2 05- 194- 164 


Sequence 164, App 


3 


1732 . 7 


95 . 


o 


1163 


9 


US-09-893-348-18 


Sequence 18, Appl 


4 


1622 . 9 


89 . 


o 


1163 


12 


US- 10-2 67-502-431 


Sequence 431, App 


5 


1603 . 3 


87 . 


9 


373 


9 


US-09-789-386-6 


Sequence 6, Appli 


6 


1603 . 3 


87 . 


9 


373 


9 


US-09-765-205-6 


Sequence 6, Appli 


7 


1603 . 3 


87 . 


, 9 


373 


9 


US-09-893-348-24 


Sequence 24, Appl 


8 


1603 . 3 


87 . 


9 


373 


12 


US-10-408-967-8 


Sequence 8, Appli 


Q 


1603 . 3 


87 . 


9 


373 


14 


US- 10- 06 0-03 6- 7 2 


Sequence 72, Appl 


1 0 


1595.3 


87 . 


5 


373 


16 


US- 10-4 66-258-4 


Sequence 4, Appli 


1 1 

X X 


1518 . 4 


83 . 


3 


1192 


9 


Us-09-7 8 9-3 8 6-2 


Sequence 2, Appli 


12 


1518.4 


83 . 


3 


1192 


9 


US-09-758-140-6 


Sequence 6, Appli 


X O 


1518.4 


83 . 


3 


1192 


9 


Us- 09- 8 93-34 8-23 


Sequence 23, Appl 


14 


1518.4 


83 , 


, 3 


1192 


9 


US-09-972-599A-6 


Sequence 6, Appli 


1 s 


1 Si R 4 

X ~J X U ■ *i 


83 . 


3 


1192 


12 


Us-10-267-502-429 


Sequence 429, App 




1 SI ft 4 

X J X O ■ *i 


83 . 


3 


1192 


14 


Us- 10- 06 0-03 6-71 


Sequence 71, Appl 


1 7 

X / 


1 SI ft 4 


83 , 


, 3 


1192 


16 


Us- 10-32 7-2 13- 9 


Sequence 9 , Appli 


1 ft 

X 0 


1 SI ft 4 


ft ^ 


-5 


1192 


16 


US-10-466-258-9 


Sequence 9, Appli 


X ,7 


1 SI 1 4 


ft9 


Cj 

> -7 


1 1 99 

X X Z? 


12 


US-10-408-967-7 


Sequence 7 , Appli 




Q9 1 

_7 X 




> -J 


199 


9 


US-0 9-8 93-34 8-21 


Sequence 21, Appl 


9 1 

Z. X 


Q D4 


49 . 


. 6 


199 


9 


Us -0 9-893-348-25 


Sequence 25, Appl 


99 


_7 u *± 


4 9 


. 6 


199 


12 


US- 10- 660-94 6-1 


Sequence 1 , Appli 


9 ^ 


ft Q7 

O -7 / 


4 9 


9 
. z. 


199 


12 


US-10-408-967-9 


Sequence 9 , Appli 


9 4 
z. 1 


ft Q S 

O _7 «J 


4 9 


t 1 


199 


12 


US-09-97 8-360A-4 67 


Sequence 467, App 


9 S 


ft fid 


4 7 


4 


199 


16 


US- 10-4 66— 258-11 


Sequence 11, Appl 


9 6 


ft 9 6 ? 


45 . 


. 3 


776 


12 


Us-io-660-94 6-5 


Sequence 5 , Appli 


97 


826.3 


45 , 


. 3 


776 


12 


US- 10-2 67-502-430 


Sequence 430, App 


9 ft 
z. o 


ft 9 ^ 9 


4 S 


. 2 


780 


12 


US- 10-2 67-502-432 


Sequence 432, App 


9 Q 
^ .7 


ft 07 4 


44 , 


. 3 


111 


14 


US-10-2 05-219-93 


Sequence 93, Appl 




7 Q n 

/ _7 U 


4 ^ 


. 3 


356 


12 


US-10-660-94 6-6 


Sequence 6, Appli 


■J X 


687.7 


37 , 


, 7 


593 


15 


US-10-10 8-2 60A-2 8 92 


Sequence 2892, Ap 




683 


37 . 


. 5 


208 


12 


US-10-660-946-7 


Sequence 7, Appli 




67 Q 


^7 


9 


266 


12 


US-10-2 7 6-77 4-2330 


Sequence 2330, Ap 


"34 


679 Q 

D / . -7 


?6 


Q 

> _7 


269 


14 


US- 10-1 06- 698-6222 


Sequence 6222, Ap 


1 S 


O O 27 




7 


267 


12 


US- 10-660- 94 6- 8 


Sequence 8 , Appli 




\j \j y 


36 


, 7 


267 


14 


US- 10-2 05-194-127 


Sequence 127, App 


^7 


6S4 6 


35 


. 9 


236 


9 


US- 09-72 9-674-2 0 


Sequence 20, Appl 


^ft 


6S4 6 


35 


m 9 


236 


9 


US- 09-7 65-2 05-2 6 


Sequence 26, Appl 


■J Zf 


654 . 6 


35 


. 9 


236 


12 


Us-10-408-967-2 


Sequence 2 , Appli 


40 


548.1 


30 


. 1 


241 


12 


US-10-660-946-3 


Sequence 3, Appli 


41 


539. 9 


29 


.6 


168 


10 


US-09-809-391-563 


Sequence 563, App 


42 


539.9 


29 


.6 


168 


10 


US-09-882-171-563 


Sequence 563, App 


43 


539. 9 


29 


. 6 


168 


12 


US-10-164-861-563 


Sequence 563, App 


44 


527 . 7 


28 


.9 


222 


12 


US-10-267-502-428 


Sequence 428, App 


45 


519.6 


28 


.5 


234 


12 


US- 10-424-599-2008 4 0 


Sequence 200840, 



ALIGNMENTS 



RESULT 1 

US-09-893-348-20 

; Sequence 20, Application US/09893348 
; Patent No. US200200724 93A1 
; GENERAL INFORMATION: 

; APPLICANT: EI SENBACH- SCHWARTZ , Michal 



; APPLICANT: COHEN, Irun R. 
; APPLICANT: BESERMAN, Pierre 
; APPLICANT:. MOSONEGO, Alon 
; APPLICANT: MOALEM, Gila 

; TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM- SPECIFIC ANTIGENS AND 
THEIR USES 

; FILE REFERENCE: EIS-SCHWARTZ=2A 

; CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 

; CURRENT FILING DATE: 2001-06-28 

; PRIOR APPLICATION NUMBER: US 09/314,161 

PRIOR FILING DATE: 1999-05-19 
; PRIOR APPLICATION NUMBER: US 09/218,277 
; PRIOR FILING DATE: 1998-12-22 

PRIOR APPLICATION NUMBER: PCT/US9 8/ 147 15 
; PRIOR FILING DATE: 1998-07-21 
; PRIOR APPLICATION NUMBER: IL 124500 

PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 29 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 20 
LENGTH: 360 
TYPE: PRT 
; ORGANISM: Rattus norvegicus 
US-09-893-348-20 

Query Match 99.7%; Score 1817; DB 9; Length 360; 

Best Local Similarity 99.7%; Pred. No. 1.8e-59; 

Matches 359; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 1 I I I I I I I 1 I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAG L S AAAVP P AAAAP L L D F S S D S VP PAP RG P L P AAP P AAP ERQPSWERS P AAP AP S L P 12 0 

I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 P AAGL S AAAVP P AAAAP LLDFSSDS VP PAP RG P L P AAP P AAP E RQ P S WE RS P AAP AP S L P 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWR 180 



Db 




Qy 



181 DIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHP 240 




Db 



181 DIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHP 240 



QY 



241 FRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG 300 




Db 



241 FRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG 300 



Qy 



301 ALFNGLTLLILALISLFSIPVI YERHQVQIDHYLGLANKSVKD7\MAKIQAKIPGLKRKAD 360 




Db 



301 ALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 



RESULT 2 

US-10-205-194-164 

; Sequence 164, Application US/10205194 



Publication No. US20030134301A1 
GENERAL INFORMATION: 
APPLICANT: Warner-Lambert Company 
APPLICANT: Lee, Kevin 
APPLICANT: Dixon, Alistair 
APPLICANT: Brooksbank, Robert 
APPLICANT : Pinnock, Robert 

TITLE OF INVENTION: Identification and Use of Molecules Implicated in Pain 
FILE REFERENCE: WL-A-018201 

CURRENT APPLICATION NUMBER: US/10/205, 194 
CURRENT FILING DATE: 5200-07-24 
PRIOR APPLICATION NUMBER: GB 0118354.0 
PRIOR FILING DATE: 2001-07-27 
NUMBER OF SEQ ID NOS : 177 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 164 
LENGTH: 379 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
FEATURE: 

OTHER INFORMATION: Foocen-m2 reticulon 
US-10-205-194-164 

Query Match 99.2%; Score 1808.1; DB 14; Length 379; 

Best Local Similarity 94.7%; Pred. No. 4.3e-59; 

Matches 359; Conservative 1; Mismatches 0; Indels 19; Gaps 1; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I II I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 P AAG L S AAAVP P AAAAP L L D F S S D S VP PAP RG P L PAAP P AAP ERQPSWERS P AAP AP S L P 120 

I I I M I I I I I I I I I I I I 1 I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 
Db 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSG 17 0 

I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 171 SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 221 

I : I II I I I I I I II I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 181 AASEPVIPSSAWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI7VLALLSVTIS 240 

Qy 222 FRI YKGVI QAI QKS DEGH P FRAYLE S EVAI S EELVQKYSN S ALGHVN ST I KELRRLFLVD 281 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI KELRRLFLVD 300 

Qy 282 DLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSV 341 

M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 301 DLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSV 360 

Qy 342 KDAMAKIQAKI PGLKRKAD 360 

I I I I I I I I I I I I I I I I I I I 
Db 361 KDAMAKIQAKI PGLKRKAD 379 



RESULT 3 



US-09-893-348-18 

Sequence 18, Application US/09893348 
Patent No. US200200724 93A1 
GENERAL INFORMATION: 
APPLICANT: EI SEN BACH -SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 95.0%; Score 1732.7; DB 9; Length 1163; 

Best Local Similarity 31.0%; Pred. No. 2.2e-55; 

Matches 360; Conservative 0; Mismatches 0; Indels 803; Gaps 1; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I II 1 I I I I I I I I I I I I M I I I I I I I II II I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 1 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I 
Db 61 P AAGL S AAAVP P AAAAP LL D FS S D S VP PAP RGP L P AAP P AAP ERQ P S WE RS P AAP AP S L P 120 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGS 171 

I I I I I I I I I I I I I I II I II I I I I I I II M I I I M I I I I I I I I I I I I I I I I I 
Db 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Qy 172 171 

Db 181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 240 

Qy 172 171 

Db 241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 

Qy 172 171 



Db 



301 AILVENTKEEVIVRSKDKEDLVCSAALHSPQESPVGKEDRWSPEKTMDIFNEMQMSWA 360 



Qy 172 " 171 

Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Qy 172 171 

Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 480 

Qy 172 171 

Db 4 81 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 54 0 

Qy 172 : 171 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 172 171 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 172 171 

Db 661 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 172 171 

Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 172 171 

Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 172 171 

Db 841 IIDEFPTFVSAKDDSPKIlAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 172 171 

Db 901 DEVHVSDEFSENRSSVSKASISPSNVS7VLEPQTEMGSIVKSKSLTKEAEKKLPSDTEKED 960 

Qy 172 SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 217 



Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 218 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS7VLGHVNSTIKELRRL 277 

I II I I I i 1 II I I I I II I I I I I I I I I I I I I I I 1 I I I I I I II I I 1 I I I I I I I I I I I I I I M I 
Db 1021 VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYSN S ALGHVN S T I KELRRL 1080 

Qy 278 FLVDDLVDSLKFAVLM^FTWGALFNGLTLLIIALISLFSIPVIYERHQVQIDHYLGLA 337 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1081 FLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLA 1140 

Qy 338 NKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I M I I I I I I I I I I I I I I I I 
Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 4 

US-10-267-502-431 

; Sequence 431, Application US/10267502 

; Publication No. US20040071700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 
; FILE REFERENCE: LSD-07416 

; CURRENT APPLICATION NUMBER: US/ 10/2 67 , 502 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 431 

LENGTH: 1163 

TYPE: PRT 
; ORGANISM: Mus mus cuius 
US-10-267-502-431 

Query Match 89.0%; Score 1622.9; DB 12; Length 1163; 

Best Local Similarity 29.7%; Pred. No. 2.6e-51; 

Matches 347; Conservative 2; Mismatches 7; Indels 811; Gaps 5; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAAPAPSLP 12 0 

I I I I II I III I I I I I I II I I II I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 60 PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGS 171 

I I I I I I I I I I I II I I I I I I II I I I I I I I I II I M M I I I I I I I I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P PAP AG AS P LAE P AAP P S T P AAP KRRGS GS VD ET L FAL P 176 



Qy 172 171 

Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 172 171 

Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 172 171 

Db 2 97 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

Qy 172 171 

Db 357 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKSHGK 416 

Qy 172 171 

Db 417 DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 476 

Qy 172 171 



Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAVVATMPEGLTPDLVQEA 536 

Qy 172 171 

Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 172 171 

Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

Qy 172 171 

Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 172 171 

Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS ETVTQHKHKERLSASPQEVGKPY 776 

Qy 172 171 

Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 836 

Qy 172 171 

Db 837 DSSPIEIIDEFPTFVSAKDDSPKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNT 896 

Qy 172 171 

Db 8 97 YPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDTE 956 

Oy 172 S WDLL YWRDI KKTGW- FGAS LFLLLS LTVFS I VS VTAYI AL 213 



Db 957 KEDRSLTAVLSAELNKTSWDLLYWRD1KKTGWYFGASLFLLLSLTVFSIVSVTAYIAL 1016 

Qy 214 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 273 

I M I I I I II I I I I I ! I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I 1 I I I I I I I I I I 
Db 1017 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 1076 

Qy 274 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHY 333 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 1 I I M I I I I I II I I I I I I I I I 
Db 1077 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHY 1136 

Qy 334 LGLANKSVKDAMAKIQAKI PGLKRKAD 360 

I I I I I I I M 1 I I I I I I I I I I I I I I II : 
Db 1137 LGLANKSVKDAMAKIQAKI PGLKRKAE 1163 

RESULT 5 
US-09-789-386-6 

; Sequence 6, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION : NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 



; CURRENT APPLICATION NUMBER: US/09/789, 386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 

; PRIOR FILING DATE: 1999-07-22 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 

TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 87.9%; Score 1603.3; DB 9; Length 373; 

Best Local Similarity 87.2%; Pred. No. 1.6e-51; 

Matches 327; Conservative 11; Mismatches 20; Indels 17; Gaps 6 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I II I ! I I I I I I I I I I I II : I I :: I I I I I I I II I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I II I I I I II III I I I : I I : I I I I I I I I I I I II I I MINIM: II : I 
Db 59 P AAG L S AAP VP T AP AAGAP LMD FGN D EVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

Ml M I I II I M I I M II I I I I I I I I I I I I I I M I I I II I II II 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 17 8 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 225 

M Ml I I I M I I II II II I II I II I I I I I II I II M I I II II II I I I I II I I I I I I M 
Db 179 RGSSGSVWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 238 

Qy 226 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I M I I I I II I I M I II M I I I M II II II I I I M I II I M I II I I II I I I I I M I I I I I 

Db 239 KGVI QAI QKSDEGHPFRAYLESEVAI S EELVQKYSNSALGHVNCTI KELRRLFLVDDLVD 298 

Qy 286 SLKFAVLMWVFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAM 345 

II I I I I I I I I I M I II II II I II I I I M I II II : I I I I I II I I II II II I I I M II I I I 

Db 299 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 34 6 AKIQAKI PGLKRKAD 360 

I I I I II I I I I I II I : 
Db 359 AKIQAKI PGLKRKAE 373 



RESULT 6 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US20020034800A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 
; FILE REFERENCE: 14 58.004/200130.44 9 



; CURRENT APPLICATION NUMBER: US/ 09/7 65 , 2 05 

; CURRENT FILING DATE: 2001-01-17 

; PRIOR APPLICATION NUMBER: US/ 09/212 , 440 

; PRIOR FILING DATE: 1998-12-16 

; NUMBER OF SEQ ID NOS : 4 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 6 

; LENGTH: 373 

; TYPE: PRT 

; ORGANISM: human 

US-09-765-205-6 

Query Match 87.9%; Score 1603.3; DB 9; Length 373; 

Best Local Similarity 87.2%; Pred. No. 1.6e-51; 

Matches 327; Conservative 11; Mismatches 20; Indels 17; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll I I I II I I I I I I I I I I I II I I I I II I I I I I I : I I :: I I I I I I I I I I I II 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL S AAAVP — PAAAAPLLDFS S DSVP PAPRGPLPAAPPAAPERQP SWERS PAA P 115 

I I I I I I I I I I I I I I I I : I I : I I I I I I I I I I I 1 I I I I I I I I I I I : I I : I 

Db 59 PAAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYITUiALLSVTISFRIY 225 

II III I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 1 II I 

Db 179 RGSSGSVWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIY 238 

Qy 226 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I I I I I I I I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I ! 
Db 239 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVD 298 

Qy 286 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAM 345 

I I I I I I I I I I I I I I I 1 I I I I I I M I I I I I I I I 1 : I I I I I I I 1 I I I I I I I I I I I : I I I I I 
Db 299 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 346 AKIQAKIPGLKRKAD 360 

I I I I I I I I II I II I : 
Db 359 AKIQAKI PGLKRKAE 373 



RESULT 7 

US-09-893-348-24 

Sequence 24, Application US/09893348 
Patent No. US2 0020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 



FILE REFERENCE: EI S- SCHWARTZ =2 A 
; CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 
; CURRENT FILING DATE: 2001-06-28 
; PRIOR APPLICATION NUMBER: US 09/314,161 
; PRIOR FILING DATE: 1999-05-19 
; PRIOR APPLICATION NUMBER: US 09/218,277 
; PRIOR FILING DATE: 1998-12-22 
; PRIOR APPLICATION NUMBER: PCT/US98/ 14715 
; PRIOR FILING DATE: 1998-07-21 
; PRIOR APPLICATION NUMBER: IL 124500 
; PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 29 

SOFTWARE : Patentln version 3.1 
; SEQ ID NO 24 
LENGTH: 37 3 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-24 

Query Match 87.9%; Score 1603.3; DB 9; Length 373; 

Best Local Similarity 87.2%; Pred. No. 1.6e-51; 

Matches 327; Conservative 11; Mismatches 20; Indels 17; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I! I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL SAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III llhll : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 225 

II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I II I I I I 

Db 179 RGSSGSVWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 238 

Qy 226 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I I I I I II I I I I I I I I I I I I I II I I I I I I I I II I I I I I I II I I I I II I I I I I I II I M I I 

Db 239 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVD 298 

Qy 286 S LKFAVLMWVFT YVGALFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAM 345 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I 

Db 299 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 34 6 AKIQAKIPGLKRKAD 360 

I I I I I I I I I II I I I : 
Db 359 AKIQAKIPGLKRKAE 373 



RESULT 8 
US-10-408-967-8 

; Sequence 8, Application US/10408967 
; Publication No. US20040063161A1 
; GENERAL INFORMATION: 



; APPLICANT: Pharmacia & Upjohn Company 
; APPLICANT: Yan, Riqiang 
; APPLICANT: Lu, Yifeng 

; TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer's Disease 
; FILE REFERENCE: 00925 

; CURRENT APPLICATION NUMBER: US/10/4 08 , 967 

; CURRENT FILING DATE: 2003-04-08 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 8 

LENGTH: 37 3 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-408-967-8 

Query Match 87.9%; Score 1603.3; DB 12; Length 373; 

Best Local Similarity 87.2%; Pred. No. 1.6e-51; 

Matches 327; Conservative 11; Mismatches 2 0; Indels 17; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 | | | | 1 | I | I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I 1 I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I II I I I I I I I I I I I I I I I I I I I : II : I 
Db 59 P AAGLS AAP VPT AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

I I I I I I I I II I I I II II I I I I I I I I I I II II I I I I I M I I II I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIY 225 

II III I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I M I I I H I I I 

Db 179 RGS SGSVWDLLYWRDI KKTGWFGASLFLLLSLTVFS I VS VTAYIALALLSVT I S FRI Y 238 

Qy 226 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 239 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVD 298 

Qy 286 SLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAM 345 

I I I I I I I I I I I II I I II I II I I I I I I I I M I I I : I t I I I I I I I I I I I I I I I I I : I I I I I 
Db 2 99 SLKFAVliMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 346 AKIQAKI PGLKRKAD 360 

I I I I I I I I I I I I I I : 
Db 359 AKIQAKI PGLKRKAE 373 



RESULT 9 

US-10-060-036-72 

; Sequence 72, Application US/10060036 

; Publication No. US20030073144A1 

; GENERAL INFORMATION: 

; APPLICANT: Benson, Darin R. 

; APPLICANT: Kalos, Michael D. 

; APPLICANT: Lodes, Michael J. 

; APPLICANT: Persing, David H. 



; APPLICANT: Hepler, William T, 
; APPLICANT: Jiang, Yuqiu 

; TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
; TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
; FILE REFERENCE: 210121.566 

; CURRENT APPLICATION NUMBER: US/ 10/ 060 , 036 
; CURRENT FILING DATE: 2002-01-30 
; NUMBER OF SEQ ID NOS : 4560 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 72 

LENGTH: 373 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-060-036-72 

Query Match 87.9%; Score 1603.3; DB 14; Length 373; 

Best Local Similarity 87.2%; Pred. No. 1.6e-51; 

Matches 32 7; Conservative 11; Mismatches 20; Indels 17; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll I I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I : I I : : I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFS SDSVPPAPRGPLPAAPPAAPERQP SWERS PAA P 115 

I I I I I I I 1 II III I I I : I I : I I I I I I I I I I II I I I I I I I II I I : II : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S W D P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

III I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 225 

II III II I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 179 RGSSGSVWDLLY^RDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIY 238 

Qy 226 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 239 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVD 298 

Qy 286 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAM 345 

I I I I I I II I I II I I I II I I I I I I I I I I I I I II I : I I I I I I I I I I II I I I I I I I : I II I I 
Db 299 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 346 AKIQAKI PGLKRKAD 360 

I I I I I I I II II I I I : 
Db 359 AKIQAKI PGLKRKAE 373 



RESULT 10 
US-10-466-258-4 

; Sequence 4, Application US/10466258 

; Publication No. US20040132096A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: ASSAY 

; FILE REFERENCE: P80966 GCW 

; CURRENT APPLICATION NUMBER: US/ 10/ 4 66 , 25 8 



; CURRENT FILING DATE: 2003-07-15 
; NUMBER OF SEQ ID NOS : 13 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 4 

LENGTH: 373 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-466-258-4 

Query Match 87.5%; Score 1595.3; DB 16; Length 373; 

Best Local Similarity 86.7%; Pred. No. 3.3e-51; 

Matches 325; Conservative 12; Mismatches 21; Indels 17; Gaps 6 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 Mill I I I I I I I II I I I II I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 P AAGL S AAAVP — P AAAAP LLDFSSDSVP PAP RG P L P AAP P AAP ERQPSWERS P AA P 115 

I I I I I I I I II III 111:11 : I I I I I I I I I I I I I II I I I I I I I : II : I 

Db 59 P AAGL S AAP VP T AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP E RQ P CWD P S P VS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

III I I I I I I I II : I I I I I I I I I I I I M I I I I I I I I I I I I I II II 

Db 119 APSPLSAAAVSPSKLPQDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RG-SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIY 225 

II III I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I 

Db 179 RGSSGSVWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIY 238 

Qy 226 KGVI QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVD 285 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I 
Db 239 KGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVD 298 

Qy 286 SLKFAVLMWVFTWGALFNGLTLLIIJVLISLFSIPVIYERHQVQIDHYLGLANKSVKDAM 345 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I 
Db 299 SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAM 358 

Qy 34 6 AKIQAKIPGLKRKAD 3 60 

I I I I I I I I I I II I I : 
Db 359 AKIQAKI PGLKRKAE 373 



RESULT 11 
US-09-789-386-2 

; Sequence 2, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MI CHALOVI CH , DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789, 386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 



PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 

Query Match 83.3%; Score 1518.4; DB 9; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 2.1e-47; 

Matches 32 6; Conservative 12; Mismatches 20; Indels 836; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II : I I :: I I I I II I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III llhll : I I I I I I I I I I I I I I I I I I I I I I I : II : I 

Db 59 P AAGL S AAP VPT AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP E RQ P S WD P S PVS ST VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSG 170 

I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 238 

Qy 171 170 

Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYS 298 

Qy 171 170 

Db 299 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDE 358 

Qy 171 170 

Db 359 WSSEK7VKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 539 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

Qy 171 170 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 



Qy 171 170 

Db 659 YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 718 

Qy 171 170 

Db 719 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 778 

Qy 171 170 

Db 779 MI EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTA 838 

Qy 171 170 

Db 839 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 898 

Qy 171 170 

Db 8 99 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 958 

Qy 171 S S WDLLYWRDI KKTG 186 

: I I I I I I I I I I I I I I I 

Db 959 ATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTG 1018 

Qy 187 WFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 246 

I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1019 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1078 

Qy 247 S EVAI S EELVQKYSN S ALGHVNST I KELRRL FLVDDLVD S LKFAVLMWVFT YVGAL FNGL 306 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 1079 S EVAI S EELVQKYSN SALGHWCT I KELRRL FLVDD LVD S LKFAVLMWVFT YVGAL FNGL 1138 

Qy 307 TLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I II I I : I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I I I I I I I I : 
Db 1139 TLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 12 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US2 002 0012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 

; TITLE OF INVENTION: No. US20020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

FILE REFERENCE: 4 4 574-5073-US 
; CURRENT APPLICATION NUMBER: US/09/758,140 
; CURRENT FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: US 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: US 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: US 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; NUMBER OF SEQ ID NOS : 20 
; SOFTWARE: Patentln Ver. 2.1 



; SEQ ID NO 6 

; LENGTH: 1192 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 83.3%; Score 1518.4; DB 9; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 2.1e-47; 

Matches 326; Conservative 12; Mismatches 20; Indels 836; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGL SAAAVP — PAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I II I I I I II III I II : II : I I I I I I I II I I I I I I I I I I I I I I : II : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSG 170 

I I I 

Db 17 9 RGS S GSVDETLFALPAASEPVI RS SAENMDLKEQPGNTI SAGQEDFP SVLLETAASLPS L 238 

Qy 171 170 

Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 298 

Qy 171 170 

Db 299 EMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDE 358 

Qy 171 170 

Db 359 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 53 9 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

Qy 171 170 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 

Qy 171 170 

Db 659 YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 718 



Qy 



171 



170 



Db 719 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 778 
Qy 171 170 

Db 779 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 838 
Q y 171 170 

Db 839 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 898 
Qy 171 170 

Db 899 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 958 

Qy i7i S SWDLLYWRDI KKTG 18 6 

: I I I I I I i I I I I I I I I 

Db 959 ATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTG 1018 

Qy 187 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 246 

I I I I I f I I M I I I I I I I II I I I I I I I I ! I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I 
Db 1019 VVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLE 1078 

Qy 247 S E VAI SEE L VQ K Y S N S AL GH VN S T I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN GL 306 

I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1079 S EVAI SEE LVQ K Y S N S ALGHVN C T I KE L RRL FLVD D LVD S L KFAVLMWVFT YVGAL FN GL 1138 

Qy 307 TLLIIALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I II I I : I I I I I II I I I I I I I I I I II : I I I I II I I I I I I I I I I I I I : 
Db 1139 TLLI LALI S LFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1192 

RESULT 13 
US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US200200724 93A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/ 09/ 893 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98 / 14 7 15 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 



; NUMBER OF SEQ ID NOS : 29 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 23 
; LENGTH: 1192 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 

Query Match 83.3%; Score 1518.4; DB 9; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 2.1e-47; 

Matches 326; Conservative 12; Mismatches 20; Indels 836; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 P AAGL S AAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I II I I I : I I : I 

Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 166 

II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSG 170 

I I I 

Db 17 9 RGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 238 

QY 171 170 

Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 2 98 

Qy 171 170 

Db 299 EMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDE 358 

QY 171 170 

Db 359 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 539 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

Qy 171 170 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 

Qy 171 170 



Db 


659 


YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 


718 


Qy 


171 




170 




Db 


719 


SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 


778 


Qy 


171 




170 




Db 


779 


MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 838 


Qy 


171 




170 




Db 


839 


VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 


898 


Qy 


171 




170 




Db 


899 


eianapdgagslpctelphdlslkniqpkveekisfsddfskngsatskvlllppdvsal 


958 


Qy 


171 


: 1 1 1 1 1 I 1 1 1 1 1 II M 

atqaeiesivkpkvlvkeaekklpsdtekedrspsaifsaelsktswdllywrdikktg 


186 


Db 


959 


1018 


Qy 

Db 


187 
1019 


WFGASLFLLLSLTVFSIVSWAYIALALLSWISFRIYKGVIQAIQKSDEGHPFRAYLE 24 6 
M M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II | | | 
WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1078 


Qy 

Db 


247 
1079 


SEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGL 
1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I || 1 1 1 1 II 1 1 1 1 1 1 1 I I I I I I I I I I I | | 
S E VAI SEE L VQ K Y S N SAL GH VN CT I K E L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN GL 


306 
1138 


Qy 
Db 


307 
1139 


TLLILALISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAD 3 60 
M 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II : 1 1 1 1 1 I I I I I | | | | | M | | : 
TLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 





RESULT 14 
US-09-972-599A-6 

; Sequence 6, Application US/09972599A 

; Patent No. US2002 0077295A1 

; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/ 09/972 , 599A 
; CURRENT FILING DATE: 2001-10-06 

PRIOR APPLICATION NUMBER: PCT/US01/ 01 04 1 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 



SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 6 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 83.3%; Score 1518.4; DB 9; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 2.1e-47; 

Matches 326; Conservative 12; Mismatches 20; Indels 836; Gaps 6; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

llhlll I I I I I I II I I I I I I I II I I I I I I I I I I I I I : I I :: I I I I I I I I M I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I I I I I I I I I : I I : I I I I II I II I I I I I I I I I I I I I I : I I : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Ml I I II II I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I II I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSG 170 

I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 238 

Qy 171 170 

Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 298 

Qy 171 170 

Db 2 99 EMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDE 358 

QY 171 170 

Db 359 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 539 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

QY 171 170 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 

Qy 171 170 



Db 



659 YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 718 



Qy 171 170 

Db 719 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 778 

Qy 171 _____ i7 0 

Db 779 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 838 

Qy 171 170 

Db 839 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEEPTLISSKTDSFSKLAREYTDLEVSHKS 898 

Qy 171 170 

Db 899 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 958 

Qy 171 SSWDLLYWRDIKKTG 186 

: I I I I I I I I I I I I I I I 

Db 959 ATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTG 1018 

Qy 187 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 246 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1019 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1078 

Qy 247 SEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGL 306 

I I M I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 

Db 1079 SEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 1138 

Qy 307 TLLIIAXISLFSIPVIYERHQVQIDHYLGIJ^KSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I I I I I I I I : 
Db 1139 TLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 15 
US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US2004007 1700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 
; FILE REFERENCE: LSD-07416 

; CURRENT APPLICATION NUMBER: US/ 10/2 67 , 502 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 429 

LENGTH: 1192 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-267-502-429 

Query Match 83.3%; Score 1518.4; DB 12; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 2.1e-47; 

Matches 326; Conservative 12; Mismatches 20; Indels 836; Gaps 6; 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 : I I :: I I I I I M I I I I I I 

Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I I I I I I I I II III I I I : I I : I I I I I I I I I I I I I I I I I I II I I I : II : I 

Db 59 P AAGL S AAP VPTAP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S PVS S T VP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP KR 166 

III I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

Qy 167 RGSG 170 

I I I 

Db 179 RGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 238 

Qy 171 170 

Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 298 

Qy 171 170 

Db 299 EMGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDE 358 

Qy 171 170 

Db 359 WSSEKAKDSFNEKRVAVE7VPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAP FNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 539 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

Qy 171 170 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 

Qy 171 170 

Db 659 YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 718 

Qy 171 170 

Db 719 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 778 

Qy 171 170 

Db 779 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 838 



Qy 171 170 

Db 839 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 898 

Qy 171 170 

Db 899 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 958 

Qy 171 SSWDLLYWRDIKKTG 186 

: I I M I I 1 I i I I I I I I 

Db 959 ATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTG 1018 

Qy 187 VVFGASLFLLLS LTVFS I VSVTAYI ALALLSVT I S FRI YKGVI QAIQKS DEGHPFRAYLE 246 

I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1019 VVFGASLFLLLS LTVFS IVSVTAYIAIiAIjLSWIS FRI YKGVI QAIQKS DEGHPFRAYLE 1078 

Qy 247 SEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 306 

I I I I II I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1079 SEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 1138 

Qy 307 TLLILALI SLFSI PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 360 

I II I I I I I I I I I : I I I I I II I I I I I I I I I I I I : I I I I II I I I I I I II I I I I I : 
Db 1139 TLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



Search completed: September 29, 2004, 19:11:29 
Job time : 96.8807 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



September 29, 2004, 18:43:17 ; Search time 26.4632 Seconds 

(without alignments) 
1308.568 Million cell updates/sec 

SEQ2_1-171_975-1163 
1823 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 360 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.1 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIR It 



pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score , greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


826. 3 


45 


3 


776 


2 


A46583 


2 


683 


37 


5 


208 


2 


160904 


3 


669 


36 


7 


267 


2 


A60021 


4 


437.7 


24 


0 


2484 


2 


T26216 


5 


427.4 


23 


4 


2607 


2 


T26215 


6 


337.5 


18 


5 


222 


2 


T26213 


7 


272.1 


14 


9 


786 


2 


A35466 


8 


266.9 


14 


6 


1173 


2 


T31421 


9 


265.5 


14 


6 


1611 


2 


T38236 


10 


258.5 


14 


2 


1058 


2 


T13286 


11 


257.5 


14 


1 


1206 


2 


S24407 


12 


256.4 


14 


1 


3530 


2 


A59266 


13 


254.9 


14 


0 


1468 


2 


S11515 



Description 



neuroendocrine- spe 
neuroendocrine- spe 
tropomyosin- relate 
hypothetical prote 
hypothetical prote 
hypothetical prote 
progesterone recep 
C-terminal domain- 
hypothetical prote 
cappuccino gene pr 
formin isoform IV 
unconventional myo 
formin - mouse 



14 


254. 


,1 


14. 


0 


716 


2 


T26998 


hypothetical prote 


15 


253. 


.8 


13. 


9 


760 


2 


F86387 


probable Pto kinas 


16 


253. 


.2 


13. 


9 


929 


2 


C96623 


hypothetical prote 


17 


253. 


2 


13. 


9 


1255 


2 


T31065 


diaphanous protein 


18 


252. 


2 


13. 


8 


2157 


2 


S71461 


proline-rich prote 


19 


251. 


9 


13. 


8 


1201 


2 


G86441 


unknown protein [ i 


20 


250. 


9 


13. 


8 


1015 


2 


JC6552 


DNA topoi some rase 


21 


250. 


2 


13. 


7 


907 


2 


E96636 


hypothetical prote 


22 


246. 


4 


13. 


5 


1657 


2 


T19536 


hypothetical prote 


23 


246 


13. 


5 


1522 


2 


T39371 


transcription regu 


24 


245. 


8 


13. 


5 


645 


2 


A71416 


hypothetical prote 


25 


245. 


4 


13. 


5 


1460 


1 


EDBEIF 


immediate— early pr 


26 


245 . 


2 


13. 


5 


3511 


2 


A59295 


unconventional myo 


27 


245. 


1 


13. 


4 


710 


2 


D96728 


hypothetical prote 


28 


244. 


7 


13. 


4 


465 


2 


G02738 


FREAC- 4 - human 


29 


244. 


3 


13. 


4 


1064 


2 


T13963 


formin related pro 


30 


242. 


7 


13. 


3 


1533 


2 


T00344 


hypothetical prote 


31 


242 . 


1 


13. 


3 


1375 


2 


S48375 


hypothetical prote 


32 


241. 


9 


13. 


3 


1213 


2 


A58198 


serine /proline— ric 


33 


241. 


7 


13. 


3 


3938 


2 


T42761 


Bassoon protein — 


34 


241. 


1 


13. 


2 


775 


1 


EDBE11 


immediate— early pr 


35 


240. 


9 


13. 


2 


980 


2 


G75523 


probable cell divi 


36 


240. 


6 


13. 


2 


1634 


2 


T26517 


hvDothetical orote 


37 


240. 


5 


13. 


2 


1446 


1 


A45344 


immediate— early pr 


38 


238. 


9 


13. 


1 


731 


2 


T04455 


hypothetical prote 


39 


238. 


5 


13. 


1 


1047 


2 


A55617 


masquerade precurs 


40 


238. 


5 


13. 


1 


1420 


2 


T37781 


nrobflbl rvtrj^kpl p 


41 


238 . 


4 


13. 


1 


7962 


2 


138346 


elastic titin - hu 


42 


238 


13. 


1 


3149 


1 


QQBE8 


BPLF1 protein - hu 


43 


237 . 


6 


13. 


0 


933 


1 


QRHUP 


progesterone recep 


44 


237. 


5 


13. 


0 


980 


2 


S54986 


regulatory protein 


45 


236. 


7 


13. 


0 


1870 


2 


S37671 


MHC class III hist 



ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine-specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaeker 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference number: A46583; MUID: 93293865; PMID:7685762 
A;Accession: A46583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-776 <R0E1> 

A;Cross-references: GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A; Accession: 160903 



A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 421-776 <ROE2> 

A;Cross-references : GB:L10334; NID:g307308; PIDN: AAA59951 . 1; PID:g307309 

C; Genetics : 

A; Gene: GDBrRTNl; NSP 

A; Cross-references : GDB: 203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 45.3%; Score 826.3; DB 2; Length 776; 

Best Local Similarity 27.7%; Pred. No. 3.9e-20; 

Matches 195; Conservative 52; Mismatches 91; Indels 367; Gaps 17 

Qy 10 VSSSTDS PPR 19 

I I I : I ||: 

Db 85 VSSAMDHTFSTTSKDGEGSCYTSLISDICYPPQEDSTYFTGILQKENGHVTISESPEELG 144 

Qy 2 0 -PPP AFKYQFVTEPED 34 

II I : I I : I I I : 

Db 145 TPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEE 204 

Qy 35 EEDEEEEEDEEED DED 50 

: : I : III II 
Db 205 VKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDKPAPVEGKIIKDHLLEESTFAPYI 264 

Qy 51 LEELE 55 

I hi 

Db 2 65 DDLSEEQRRAPQITTPVKITLTEIEPSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPE 324 

Qy 56 VLERKPAAGLS 66 

: I I I I I 

Db 325 DDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQLADRP 384 

Qy 67 AAAVPPAAAAPLLDFSSDSV PPA 89 

I : I I : I I I : I I I : 

Db 385 EVKARSGPPTIPSP-LDHEASSAESGDSEIELVSEDPMAAEDALPSGYVSFGHVGGPPPS 443 

Qy 90 PRGPL PAAPPA 100 

II III 

Db 444 PASPSIQYSILREEREAELDSELIIESCDASSASEESPKREQDSPPMKPSALDAIREETG 503 

Qy 101 — APERQPS WERSPAAPAPSLPPA AAVLPSKLPEDD E 135 

I I I I I : : I I 1 I I : I I I I I : I : 

Db 504 VRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGALEPETPMLPRK-PEEDSSSNQ 562 

Qy 136 P PARP P PPP PAGAS PLAEPAAP P ST PAAPKRRGS GS S WDLL YWRDI KKT GWFGAS L FL 195 

II III I I I I I : : : I I I I I I I I I : II : I II : I I 

Db 563 SPAATKGPGPLG PGAPPPLLFLNKQK AI DLLYWRDI KQTGI VFGS FLLL 611 

Qy 196 LLSLTVFSIVSVTAYI7VLALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEEL 255 

I I II Ihlll Ihlll II II I I I I I I I : I I : I I : I I I I I I : I I I I I : : I : I 
Db 612 LFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQ 671 

Qy 256 VQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALIS 315 

: I I I : : : I I I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I II I I I II : : I : : I 

Db 672 IQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALFNGLTLLLMAWS 731 



Qy 316 LFSIPVI YERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

: I : : I I : I : I I III I I I I : : li I I I I I I I II I : 

Db 732 M FT L P WYVKH QAQ I DQ Y L G L VRT H I NAWAK I QAK I P GAKRHAE 77 6 



RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C; Accession: 160904 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A;Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A; Accession: 160904 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A;Cross-references : GB:L10335; NID:g307310; PIDN : AAA59952 . 1; PID:g307311 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A/Cross-references: GDB: 203968; OMIM: 600865 
A; Map position: 14q21-14q22 

Query Match 37.5%; Score 683; DB 2; Length 208; 

Best Local Similarity 67.4%; Pred. No. l.le-16; 

Matches 128; Conservative 31; Mismatches 31; Indels 0; Gaps 0; 

Qy 171 SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 230 

I : | | I I | I I II : II : I I I : I II III I I : I I I I I : I I I II I I I I I I I I I : I 
Db 19 SQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQ 78 

Qy 231 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 290 

I : I I : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : I I I I I I I I I I I I I II I I I 
Db 79 AVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFA 138 

Qy 291 VLMWFTWGALFNGLTLLIIAJjISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQA 350 

MM: I I I I I I I I f I I I [ : : I : : I : I : : | I : I : I I Ml I I I I : : I I I II 

Db 139 VLMWLLTWGALFNGLTLLLMAVVSMFTLPVVTVKHQAQIDQYLGLVRTHINAVVAKIQA 19 8 

Qy 351 KIPGLKRKAD 360 

I I II II I : 
Db 199 KIPGAKRHAE 208 



RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C; Access ion : A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 



A; Title: Developmentally regulated cDNA expressed exclusively in neural tissue. 
A;Reference number: A60021; MUID : 91278684 ; PMID:1647480 
A; Access ion: A60 021 
A;Molecule type: mRNA 
A; Residues: 1-267 <WIE> 

A; Cross-references : EMBL:X52817; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 36.7%; Score 669; DB 2; Length 267; 

Best Local Similarity 66.7%; Pred. No. 6.4e-16; 

Matches 124; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 

Qy 171 SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 230 

I : I I I I I I I I I : I I : I II : I II III I I : I I I I I : I I I I I I I I I I I I | I : I 
Db 10 SQAI DLLYWRDI KQTGIVFGS FLLLLFSLTQFS VVSVVAYLALAALSAT1 SFRI YKS VLQ 69 

Qy 2 31 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 290 

I : I I : I I I I I I : I I I 1 I : : I : I : I I I : : : I I I I : I I I I I I I I I I I I I I I I I I 

Db 7 0 AVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKFA 12 9 

Qy 291 VLMWFTWGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGIANKSVKDAMAKIQA 350 

I I M : I I I I I I I I I I I I I : : I : : I : I : : I I : I : I I I : I II I I : MINI 

Db 130 VLMWLLTWGALFNGLTLLLMAVVSMFTLPVVYVKHQAQVDQYLGL 18 9 

Qy 351 KIPGLK 356 

I I I I : 

Db 190 KIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T26216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Accession: T26216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A; Cross-references : EMBL: Z7 8 066; PIDN: CAB514 67 .1; GSPDB : GN00023 ; CESP:W06A7 . 3c 
A; Experimental source: clone W06A7 
C; Genetics : 

A; Gene : CESP : W06A7 . 3c 
A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 24.0%; Score 437.7; DB 2; Length 2484; 

Best Local Similarity 14.8%; Pred. No. 1.2e-05; 

Matches 135; Conservative 77; Mismatches 138; Indels 563; Gaps 21; 

Qy 1 MEDIDQSSL VSSST 14 

I I : : I : I I I I 

Db 1566 MEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPGQVQERIIPIEV 1625 



Qy 15 DSPPRPPPAFKYQF — VTEPEDE 35 

I I II I I : I : I I : 
Db 1626 EQAPTIPQRPPRAPKSELPKVAKPLDDSKSRVRFAPLNIKLGRTYSEEQQKELVESLERP 1685 

Qy 36 ED EEEEEDE — 44 

N |::M I 

Db 1686 LTIITQQKPPEKPTEDIGALSPLSPNTLAEYEEVPMMDMQSVPHSPQEKQEEIEALSEII 1745 

Qy 45 EEDDEDLEELEVL 57 

I : I : I I I I : : 

Db 1746 EEPQAMKEVEKPVESAPEKDNESLEAPEIINEPIRRVLVETKIMGPGKSLNEDNDDDDDG 1805 
Qy 58 57 

Db 1806 SECLDSIGDLSERTIQRFNTSIDDPSIRRDSFSSISSFGDRQKFRTAI ENIRQDLLPFQS 1865 

Qy 58 ERKPAA 63 

I : I : I 

Db 18 66 SVSQYLRSSPNPSQQLLVTNLSMDSPSDLSPNAPPVGFENTAQFLEKLQQEDRPSMGSI 1925 

Qy 64 GLSAAAVPP 72 

II III 

Db 1926 DSSGFEKVDHEGLDEFAAPPVHDPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATL 1985 

Qy 73 AAAAPLL 79 

I III 

Db 1986 KKNQKMSSHHNDVIEKNYFNDNAPTAALLESPIAEEARKLVQDAVESASEYKKQAVDSGD 2045 

Qy 80 DFSSDSVPPA PRGPL 94 

I I : : I I I II: 
Db 2046 EIGRELLDNVEQKIEQVKEPIVDSLHKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPV 2105 

Qy 95 P AAPP AAP ERQ 105 

I II II I : 

Db 2106 PEKIETPEPLVDIHDTVDKVHDEVDNFLRREPTPPFETDDVAPLSDDKPQFGNQTPEEDE 2165 

Qy 106 PSWER SPAAPAPSLPPAAAVLPSKLPEDD 134 

: : : I : I I : : I I I I I I : : 

Db 2166 TTFDRKGPLTIPEEVEKAAAAQNNDLDDFDPLVTSNTGAAFGAAVGAAAAV — ESLTEEE 2223 

Qy 135 EPPARPPPPPPAGASPLAEPAAPPST— PAAP KRRGSG- 17 0 

::: Ml |::| I I 

Db 2224 MFGHQKFETVPRPPTPP KDI SDEDVKPSTVNLGPSHHHSHPSSPHHSILKHHGDAW 2279 

Qy 171 S S WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYI ALALLSVT I S FR 223 

I : I : : I I I I I I : : I : I : I I : : : : | | | | | | 

Db 2280 IDFKTVPPCVLDVIYWRDAJ^KSAIVLSLAL 2339 

Qy 224 IYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDL 283 

: : I I I : I : I I I I I : : : : | | : : : | : | : : | | ; 

Db 2340 VFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESP 2399 

Qy 284 VDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKD 343 

::|:M :::| I I : : I : I I I I I I : : | | : | : | | : | || :| : :|: 
Db 2400 LESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKN 2459 



Qy 344 AMAKIQAKI PGLK 356 

I I : I I : 
Db 2460 VQNIIDEKLPFLR 2472 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A/Reference number: Z20173 
A; Accession: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references : EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN0 0023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3a 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query Match 23.4%; Score 427.4; DB 2; Length 2607; 

Best Local Similarity 13.0%; Pred. No. 3.1e-05; 

Matches 135; Conservative 78; Mismatches 137; Indels 686; Gaps 21; 

Qy 1 MEDIDQSSL VSSST 14 

I I : : I : I I I I 

Db 1566 MEWTESEISEMAPQVSESTCPIPEPLADLKLPVEDDEKTPEPEPWPGQVQERIIPIEV 1625 

Qy 15 DSPPRPPPAFKYQF — VTEPEDE 35 

I I II I I : I : I I : 
Db 162 6 EQAPTIPQRPPRAPKSELPKVAKPLDDSKSRVRFAPLNIKLGRTYSEEQQKELVESLERP 1685 

Qy 36 ED EEEEEDE 44 

M I::|| I 

Db 1686 LTIITQQKPPEKPTEDIGALSPLSPNTLAEYEEVPMMDMQSVPHSPQEKQEEIEALSEII 1745 

Qy 45 EEDDEDLEELEVL 57 

I : I : I I I I : : 

Db 1746 EEPQAMKEVEKPVESAPEKDNESLEAPEIINEPIRRVLVETKIMGPGKSLNEDNDDDDDG 1805 

Qy 58 57 

Db 1806 SECLDSIGDLSERTIQRFNTSIDDPSIRRDSFSSISSFGDRQKFRTAI ENIRQDLLPFQS 1865 

Qy 58 ERKPAA 63 

I : I : I 

Db 1866 SVSQYLRSSPNPSQQLLVTNLSMDSPSDLSPNAPPVGFENTAQFLEKLQQEDRPSAEGSI 1925 

Qy 64 GLSAAAVPP 72 

II III 

Db 1926 DSSGFEKVDHEGLDEFAAPPVHDPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATL 1985 



Qy 7 3 AAAAPLL 79 

I III 

Db 1986 KKNQKMSSHHNDVIEKNYFNDNAPTAALLESPIAEEARKLVQDAVESASEYKKQAVDSGD 2045 

Qy 80 DFSSDSVPPA PRGPL 94 

II ::|| I ||: 
Db 2046 EIGRELLDNVEQKIEQVKEPIVDSLHKAYDGVGDFVHETVPNAVDDFVREAEKQLPESPV 2105 

Qy 95 P AAPP AAP ERQ 105 

I I I M | : 

Db 2106 PEKIETPEPLVDIHDTVDKVHDEVDNFLRREPTPPFETDDVAPLSDDKPQFGNQTPEEDE 2165 

Qy 106 PSWER SPAAPAPSLPPAAAVLPSKLPEDD 134 

: : : I : I I : : I I I I I I : : 

Db 2166 TTFDRKGPLTIPEEVEKAAAAQNNDLDDFDPLVTSNTGAAFGAAVGAAAAV — ESLTEEE 2223 

Qy 135 EPPARPPPPPPAGASPLAEPAAPPST PAAPKR 166 

I III II ::: Ml |::| 

Db 2224 MFGHQKFETVPRPPTPP KDI S DEDVKP STVNLGPSHHHSHPS S PHHS I LKHHGDAW 2279 



Qy 167 166 

Db 2280 IDFKTVPPCAQNAFSPGEIMFLLAFFVYLSCFASFFSKSLPLLDNLLSLWYLSISLIIH 2339 

Qy 167 166 

Db 234 0 VKHHRKFRWNEEQATTMSKLGAVGRGLYALIAFIVNIVLRVGLNVALWGVAVSAHEAYK 2399 

Qy 167 — RGSG SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 220 

: I I I : I : : I I I I I I : : I : I : I I : : : : | | | | 

Db 2400 LTKSSGVLRKKEVLDVIYWRDAKKSAIVLSLALLVLFVLAKYPLLTVVTYSLLLALGAAA 2459 



QY 221 SFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLV 280 

I 1 : : I I I : I : I II I I : : : : | | : : : | : | : : | | 

Db 24 60 GFRVFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFV 2519 

QY 281 DDLVDSLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKS 34 0 

= : : I : I I : : : I I I : : I : I I I I I | : : | | : | : | | : | | . : | : 
Db 2520 ESPLESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGH 2579 



Qy 341 VKDAMAKIQAKI PGLK 356 

: I : I I : I I : 

Db 2580 LKNVQNIIDEKLPFLR 2595 



RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Accession : T2 6213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 



A;Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A; Cross-references: EMBL : Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene : CESP : WO 6A7 . 3b 

A;Map position: 5 

A;Introns: 27/1; 77/2; 201/2 

Query Match 18.5%; Score 337.5; DB 2; Length 222; 

Best Local Similarity 25.7%; Pred. No. 8.1e-05; 

Matches 65; Conservative 49; Mismatches 84; Indels 55; Gaps 2 

Qy 104 RQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAA 163 

: I I : I II I 

Db 13 KQPTW VPATDFP 24 

Qy 164 PKRRGSGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIAiALLSVTISFR 223 

: : I : : I I I I I I : : I : I : I I : : : : I I I I | I 

Db 25 GKILDVIYWRDAKKSAIVLS1AXLVXFVLAKYPLLTVVTYSLLLALGAAAGFR 77 

Qy 224 IYKGVI QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDL 283 

: : I I I : I : I III I : : : : I I : : : I : I : : I I : 

Db 78 VFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESP 137 

Qy 284 VDSLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKD 343 

: : I : I I : : : I I I : : I : I I I I I I : : I I : I : I I : I I I : I : : I : 
Db 138 LESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKN 197 

Qy 344 AMAKIQAKIPGLK 356 

I I : I I : 
Db 198 VQNIIDEKLPFLR 210 



RESULT 7 
A35466 

progesterone receptor form B - chicken 
N; Contains: progesterone receptor form A 
C; Species : Gallus . gallus ( chicken) 

C;Date: 18-Nov-1994 #sequence_revision 18-Nov-1994 #text_change 20-Aug-1999 
C;Accession: A35466; S06284; A40903; A24661; A24312; A40911; A61552 
R;Jeltsch, J.M.; Turcotte, B.; Gamier, J.M.; Lerouge, T . ; Krozowski, Z.; 
Gronemeyer, H.; Chambon, P. 
J. Biol. Chem. 265, 3967-3974, 1990 

A; Title: Characterization of multiple mRNAs originating from the chicken 

progesterone receptor gene. Evidence for a specific transcript encoding form A 

A;Reference number: A35466; MUID: 90154085; PMID:2303488 

A;Accession: A35466 

A; Status : translation not shown 

A; Molecule type: DNA 

A; Residues: 1-786 <JEL> 

A; Cross-references: GB:M32732; GB:J05240; NID:g212554; PIDN : AAA49011 . 1; 
PID:g212558 

R;Gronemeyer, H.; Turcotte, B.; Quirin-Stricker , C; Bocquel, M.T.; Meyer, M.E 
Krozowski, Z.; Jeltsch, J.M. ; Lerouge, T.; Gamier, J.M. ; Chambon, P. 
EMBO J. 6, 3985-3994, 1987 



A; Title: The chicken progesterone receptor: sequence, expression and functional 
analysis . 

A;Reference number: S06284; MUID: 88166640; PMID:3443098 
A; Access ion: SO 62 8 4 
A;Molecule type: DNA 
A; Residues: 1-786 <GRO> 

A;Cross-references: EMBL : Y00092 ; NID:g63744; PIDN : CAA68282 . 1 ; PID:g63745 
R;Conneely, O.M.; Dobson, A.D.W.; Tsai, M.J.; Beattie, W.G.; Toft, D.O.; 
Huckaby, C.S.; Zarucki, T.; Schrader, W.T.; O'Malley, B.W. 
Mol. Endocrinol. 1, 517-525, 1987 

A;Title: Sequence and expression of a functional chicken progesterone receptor. 

A; Reference number: A40903; MUID: 91042592 ; PMID : 3153474 

A; Access ion: A40903 

A; Status: preliminary 

A;Molecule type: mRNA 

A;Residues: 1-64 , 1 E ■ , 65-786 <CON> 

A;Cross-ref erences : GB:M37518 

R;Conneely, O.M.; Sullivan, W.P.; Toft, D.O.; Birnbaumer, M. ; Cook, R.G.; 
Maxwell, B.L.; Zarucki-Schulz, T. ; Greene, G.L.; Schrader, W.T.; O'Malley, B.W. 
Science 233, 767-770, 1986 

A;Title: Molecular cloning of the chicken progesterone receptor. 
A; Reference number: A24661; MUID: 86289413; PMID:2426779 
A; Accession: A24661 
A;Molecule type: mRNA 

A; Residues: 128-133, 'E*, 135-147, ' E',14 9- 164 <C02> 

A;Note: the authors translated the codon CAG for residue 7 as Glu and CAG for 
residue 21 as Glu 

R;Jeltsch, J.M. ; Krozowski, Z.; Quirin-Stricker , C; Gronemeyer, H.; Simpson, 

R.J.; Gamier, J.M. ; Krust, A.; Jacob, F.; Chambon, P. 

Proc. Natl. Acad. Sci. U.S.A. 83, 5424-5428, 1986 

A;Title: Cloning of the chicken progesterone receptor. 

A; Reference number: A24312; MUID : 8628727 1 ; PMID:2426697 

A;Accession: A24312 

A;Molecule type: mRNA 

A; Residues: 417-490 <JE2> 

A; Cross-references : GB:M14280; NID:g212607; PIDN :AAA4 9039 . 1 ; PID:g212608 
A;Note: amino acid and corresponding nucleotide sequences are also shown for 
three smaller peptides 

R; Birnbaumer, M. ; Hinrichs-Rosello, M.V. ; Cook, R.G.; Schrader, W.T.; O'Malley, 
B.W. 

Mol. Endocrinol. 1, 249-259, 1987 

A;Title: Chemical and antigenic properties of pure 108,000 molecular weight 
chick progesterone receptor. 

A/Reference number: A40911; MUID: 88288199; PMID:3453892 
A;Accession: A40911 
A; Status: preliminary 
A; Molecule type: protein 

A; Residues: 128-133, ' E 1 , 135-147, ' E 1 , 14 9-164 ; 54 6-558 <BIR> 
R;Simpson, R.J.; Grego, B. ; Govindan, M.V. ; Gronemeyer, H. 
Mol. Cell. Endocrinol. 52, 177-184, 1987 

A; Title: Peptide sequencing of the chick oviduct progesterone receptor form B. 
A; Reference number: A61552; MUID : 88 005426; PMID: 3653503 
A; Accession: A61552 
A;Molecule type: protein 

A;Residues: 136-153 ; 168-174 ; 195-228 ; 52 6-537 , 1 X 1 , 539; 546-563 <SIM> 
C; Genetics: 

A;Introns: 400/2; 451/1; 490/1; 591/1; 639/2; 683/1; 735/3 



C; Superf amily: progesterone receptor; erbA transforming protein homology 
C; Keywords: DNA binding; nucleus; steroid hormone receptor; zinc finger 
F; 1-786/Product : progesterone receptor form B #status predicted <MAl> 
F ; 128-7 8 6/ Product : progesterone receptor form A #status predicted <MA2> 
erbA transforming protein homology <ERBA> 
zinc finger 



F;419-682/Domain 
F;421-441/Region 
F;457-481/Region 



zinc finger 



Query Match 14.9%; Score 272.1; DB 2; Length 786; 

Best Local Similarity 15.9%; Pred. No. 0.31; 

Matches 121; Conservative 54; Mismatches 119; Indels 469; Gaps 28; 

QY 11 SSSTDSPP RPPPA FKYQFVTEPEDEEDEEEEEDEEEDD 48 

I I : I : II: I I : I I : I I I I : I I I : 

Db 6 SKETRAPSSARDGAVLLQAPPSRGEAEGIDVALDGLLYPRSSDEEEEEEENEEEEEEEEP 65 

QY 4 9 EDLEELEVLER KPAAG LSAAAVP PAAAAPLLDFSSD — SVPP 8 8 

: Ml I : I I | I I I I I I : | I 

Db 66 QQREEEEEEEEEDRDCPSYRPGGGSLSKDCLDSVLDTFLAPAAHAAPWSLFGPEVPEVPV 125 

Qy 89 AP — RGP L P AAP PAAP ERQPSWERS PAAP AP 117 

I I I I I : I I I II I I I || 

Db 126 APMSRGPEQKAVDAGPGAPGPSQP RP GAP LW P GAD S LN VAVKAR P G P E DAS EN RAP 181 

QY 118 SLP PAAAVLPSKLPEDDEP 136 

M Mill:: M 
Db 182 GLPGAEERGFPERDAGPGEGGLAPAAAAS PAAV EPGAGQDYLHVP I LPLNSAFLAS 237 

Qy 137 136 

Db 238 RTRQLLDVEAAYDGSAFGPRSSPSVPAADLAEYGYPPPDGKEGPFAYGEFQSALKIKEEG 2 97 

Qy 137 PARPPP PPPAGASPLAE PAAP PST 160 

I I I I I I | | | | M| 

Db 298 VGLPAAPPPFLGAKAAPADFAQPPRAGQEPSLECVLYKAEPPLLPGAYGPPAAPDSLPST 357 

Qy 161 PAAP 164 

I I I 

Db 358 SAAPPGLYSPLGLNGHHQALGFPAAVLKEGLPQLCPPYLGYVRPDTETSQSSQYSFESLP 417 

Qy 165 KR- RGSGSSWDLL Y 17 8 

II I : II : 

Db 418 QKICLICGDEASGCHYGVLTCGSCKVFFKRAMEGQHNYLCAGRNDCIVDKIRRKNCPACR 477 

QY 179 WRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEG 238 

I : I : I I ||: : : : : | : | : 
Db 478 LRKCCQAGMVLGGRKFKKLN KMKWRTLDVALQQP 512 

Qy 239 HPFRAYLESEVAISEELVQKYS NSALGHVNSTIKELR 275 

I I : I :: I I : I I : I I I I 
Db 513 AVLQDE TQSLTQRLSFSPNQEI PFVPPMISVLRGIEPEWYAGYDNTKPETPS 565 

Qy 276 RLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTL 3 08 

I :|| : :::: I ::| 
Db 566 SLLTSLNHLCERQLLCWKWSKLLPGFRNLHIDDQITLIQYS W MSL 611 



Qy 



309 LILAL- 



ISLFSIPVIYERHQVQIDHY 333 



' ' ' - I - - • I • I I [ : ; 

Db 612 MVFAMGWRSYKHVSGQMLYFAPDLILNEQRMKESSFYSLCLSMWQLPQEFVRLQVSQEEF 671 

Qy 334 L GLANKSVKDAM AKIQAKI PGLKRK 358 

I II ::| I I : I ||::| 

Db 672 LCMKALLLLNTI PLEGLRSQSQFDEMRTSYI RELVKAI GLRQK 714 



RESULT 8 
T31421 

C-terminal domain-binding protein rAl - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 07-Dec-1999 
C;Accession: T31421 

R;Yuryev, A.; Patturajan, M. ; Litingtung, Y.; Joshi, R.V. ; Gentile, C; Gebara, 
M. ; Corden, J. L . 

Proc. Natl. Acad. Sci. U.S.A. 93, 6975-6980, 1996 

A;Title: The C-terminal domain of the largest subunit of RNA polymerase II 
interacts with a novel set of serine/arginine-rich proteins. 
A; Reference number: Z21024; MUID: 96293459; PMID: 8692929 
A; Accession: T31421 

A; Status: preliminary; translated from GB/.EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1173 <YUR> 

A;Cross-references: EMBL:U49056; NID : gl438531 ; PID : gl438532 ; PIDN : AAC52657 . 1 
A; Experimental source: hippocampus 

Query Match 14.6%; Score 266.9; DB 2; Length 1173; 

Best Local Similarity 9.8%; Pred. No. 1.3; 

Matches 116; Conservative 32; Mismatches 69; Indels 961; Gaps 26; 

Qy 11 SSSTDSPPRPPP AF KYQ 27 

I II : I II I II | : || 

Db 119 SSSSPSPPPPPPPPPPPALPAPRFDI YDPFHPTDEAYSPPPAPEQKYDPFEATGSNPSSS 178 

Qy 2 8 -FVTEPEDEEDEEEEEDEE 45 

I I : I I : I I I I I : I I 

Db 179 GGTPSPEEEEEEEEEEEEEGLSQSIRRISETLAGIYDDNSLSQDFPGDDSPHREPPPPQT 238 

Qy 46 EDD 48 

I 

Db 239 LGAPGTPPQADSTRAEGAPRRRVFWGPEAEACLEGKVSVEWTTAGGPALPLPPLPPTD 298 

Qy 49 EDLEELEVLE RKP AAGLSAAAVPPAAA — AP LL 79 

: : I I I : : : I : I I :: I I I I I : I I I I 

Db 2 99 PEIEEGEIVQPEEEPRVAVSLFRAARPRQPPASVATLASVAAPAAP PASAPRAPEGDDFL 358 

Qy 80 DFSSDS VPPAP RGPLPAAPPA— 100 

: I I I I II I I I I I 

Db 359 S LHADSDGEGALQVDLGEP PAP PAADARWGGLDLRRKI LTQRRERYRQRSAS PGPP PARK 418 

Qy 101 APERQPSWE 10 9 

II I : I I 

Db 419 KARRERQRSGDPAPPDSPTWEAKKHRSRERKLGSHSTARRRSRSRSRRRSRSRSADRRRG 478 



Qy 



110 



114 



Db 479 SHRSRSREKRRRRRRSAS PPPAASS SS S SRRERHRGKRREGGKKKKKRSRSRAEKRSGDL 538 

Qy 115 PAPSLP PAAAV 125 

Mil I | I I 

Db 539 EKL PAP VPPSGSDRDSRRRGAVPPSIQDLTDHDLFAIKRTITVGRPDKTE PRAPS PAP AV 598 

Qy 126 LPS 128 

I 

Db 599 SPKGEVLYDSEGLSADERGAKGDKDRRRSGAASSSSSSREKASRRKALDGDRGRDRDRSS 658 

Qy 129 KLPEDDEP 136 

: I : I I 

Db 659 KKPRTPKDSAPGSGALPKAPPRSGSSSSSSSCSSRKVKLQSKVAVLIREGVSSTTPAKDS 718 

Qy 137 136 

Db 719 SSSGLGSIGVKFSRDRESRSPFLKPDERSPAEGVKVAPGSTKPKKTKAKAKAGAKKAKGT 778 

Qy 137 P 137 

I 

Db 779 KGKTKPSKTRKKVRSGGSSTASGGPGSLKKSKADSCSQAASAKGTEETSWSGEERTTKAP 838 

Qy 138 ARPP PPPP 145 

: I I I I I I 

Db 839 STPPPKVAPPPPALTPDSQTVDSSCKTPDVSFLAEEASEDTGVRVGAEEEEEEEEEEEEE 898 

Qy 146 AGASPLAEPAA 156 

I I : I I : I 

Db 899 EEQQPATTTATSTAAAAPSTAPSAGSTAGDSGAEDGPJ^ARASQLPTLPPPMPWNLPAGVD 958 

Qy 157 PPST 160 

I I I I 

Db 959 CTTSGVLALTALLFKMEEANLASRAKAQELIQDTNQILRHRKPPSTLGVTPAPVPTSFGL 1018 

Qy 161 PAAPKRRGSGSSWDLLYWRDIKKTGWFGA 191 

II: II II I i 

Db 1019 PPAPSSYLLPGSLPIGGCGSTPPTPTGLVPASDKREGSSSS 1059 

Qy 192 S LFLLLS LTVFS I VS VTAYI ALALL S VT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI 251 

I I 

Db 1060 EGR 1062 

QY 252 SEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLIL 311 

I I I : : I : I : : I 

Db 1063 GDTDKYLKKLHTQERAVEEVK 10 83 

Qy 312 ALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI 352 

II I : : : : I | | : | | | 

Db 1084 LSI KPYYQKKDITKEEY KDILRKAVHKICHSKSGEINPVKVSNLVRA 1130 

Qy 353 — PGL 355 

I I I 

Db 1131 YVQRYRYFRKHGRKPGDPPGPPRPPKEPGPPDKGGPGL 1168 



RESULT 9 
T38236 



hypothetical protein SPAC23A1.17 - fission yeast ( Schizosaccharomyces pombe) 
C; Species: Schizosaccharomyces pombe 

C;Date: 03-Dec-1999 #sequence__revision 03-Dec-1999 #text_change 03-Dec-1999 
C; Accession: T38236 

R;Murphy, L. ; Harris, D.; Wood, V.; Barrell, B.G.; Rajandream, M.A. 
submitted to the EMBL Data Library, February 1998 
A;Reference number: Z21780 
A; Accession: T382 36 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-1611 <MUR> 

A; Cross-references: EMBL : AL0218 13 ; PIDN : CAA16991 . 1 ; GSPDB: GN00066; 
SPDB:SPAC23A1.17 

A; Experimental source: strain 972h-; cosmid c23Al 
C; Genetics : 

A; Gene : SPDB : SPAC23A1 . 17 
A;Map position: 1 

Query Match 14.6%; Score 265.5; DB 2; Length 1611; 

Best Local Similarity 10.1%; Pred. No. 3.1; 

Matches 131; Conservative 59; Mismatches 106; Indels 1005; Gaps 30 

Qy 2 EDIDQSSLVS SSTDSPPRP 20 

I:: :| |: :|: I :| 

Db 299 EELSKSQRVAKDDDPEWSNTANSDEPASSSKPAKPLTDLNRAFSQRLNLDPQKPGKSQG 358 

Qy 21 PPAFKYQFVTEPED EEDEEEEEDEEE 46 

I : : : I I I I I I I : I : i : I : 

Db 359 EISEQEEDEYDDAESDEMHSPYSTHEPESEPEDQDEPSEKDDENKDVEEEQEQEQEEEQI 418 

Qy 47 DDEDLEELEVLER 59 

I I : : : : I I 

Db 419 DPEEAKRIALRERMAKMSGGIGMHVFGLPGLAAPIGRKNTLRRT PAKSSEEAKSTTNDSS 478 

Qy 60 KPAAGLS 66 

I I I 

Db 479 PPKDSSSTSTQPTEQSNAQQAPSPKEEERPLPSEPSQNQPAEYRDTPDTPRNIMPLPGLM 538 

Qy 67 AA 68 

: I 

Db 539 SADQPIKVTEPSNDADKAIVAEGPNNEEETKGPVIPETQETSEQQVHKTPSPEKQKVXSP 598 

Qy 69 AVPPAAAAPLL 7 9 

III : I I : 

Db 599 PPIITNFDKETLASNEAHEAVPQKPSAPQVTRLMAPQDSSSVWPSPTSLLDP7VRAVRKV 658 

Qy 80 DFSS DS 85 

I I : I II 
Db 659 IDGIDPPKEAGAGATADVESAANSPITPPRTWHSPDFTSKSFEPIERKLPSRISEVTEDS 718 

Qy 8 6 V PPAP 90 

: III 
Db 719 IDEDKQNEVDPSTSARALPPPGLRFGKVDTLASLAHDDLDDLPAVPRIFSPPPLPKTPSG 778 

Qy 91 RG P 93 

I I I 
Db 779 EFGDNEFMFPKKSNRVRGHQSRPSTGSQLRNWPVSIVTSGGRPALPDEMASPSSSIGHP 838 



Qy 94 LPAAPPA APERQPSWERS 111 

I I : I I I I I I I I I : I 

Db 839 LPSPPPADFNSLNVDFYEPHSYLESPAPEPQPSYEEESFNATVIHAPTPSTATFQGHPTI 898 

Qy 112 111 

Db 899 SNVATPPLKQDVTESKASPVADASATHQSSTGLTQEITQLGSNMRLPTKLTRPSNDGRKA 958 

Qy 112 PAAPAPSLPPAAAV L 126 

111111:111 I 
Db 959 SGPRPAAP-PSIPPPLPVSNILSSPTSEPPKDHPPSAPLSKPVSTSPAAPLARVPPVPKL 1017 

Qy 127 PSK LPEDDEPP 137 

II II III 

Db 1018 SSKAPPVPLPSADAPPIPVPSTAPPVPIPTSTPPVPKSSSGAPSAPPPVPAPSSEIPSIP 1077 

Qy 138 137 

Db 107 8 APSGAPPVPAPSGIPPVPKPSVAAPPVPKPSVAVPPVPAPSGAPPVPKPSVAAPPVPVPS 1137 

Qy 138 ARPPPPPPAGASPLAEP — AAPP 158 

I I I I I : I I I :: I I I I I 
Db 1138 GAPPVPKPSVAAPPVPAPSGAPPVPKPSVAAPPVPAPSSGIPPVPKPAAGVPPVPPPSEA 1197 

Qy 159 STP 161 

I I I 

Db 1198 PPVPKPSVGVPPVPPPSTAPPVPTPSAGLPPVPVPTAKAPPVPAPSSEAPSVSTPRSSVP 1257 

Qy 162 AAPKR RGS 169 

I I I I 

Db 1258 SPHSNASPSPTSSSMASAAPARTSVSRSKSKAERHETSTSSRKSSKSGEHHHHHNEGHAD 1317 

Qy 170 GSSWDLLYWRDIKKTGWFGASLFLLLSLTVF SIVSVTAYI 211 

II I : I : I II II: I I I I I 

Db 1318 SSSTRTSLAHQDSRK SLHRHLSRSSSRASKKPSIVSTTGPFNESFSAKPVEPC 1370 

Qy 212 ALALLSVT I S FRI YKGV 228 

:: :: :: : | :|: 

Db 1371 AS EKWWLNSTAVPKS WQMNDSVLYMI KEGI TGQDKKYKSVHI LFPDYSQTVLTAT FNPH 1430 

Qy 229 IQAIQKSDEGHPFRAYLESEVAISEELVQK YSNSALGHVNSTIKELRRLF 278 

. : : I : I : I I : I I : I I I : I : : 
Db 1431 NQNITQLSQLQLAP PAQPSKARLDEEYACYGSTILKKARAYQGSMVGDGSA 1481 

Qy 279 LVD D LVD S L K FAVLMWVFT YVGAL FNGLTLLILALISLFSI PVI Y 323 

I I : I : : II II I 

Db 1482 FTFVNSVMSILAHNLEPINKQTFGG VIY 1509 

Qy 324 ERHQVQI DHYLGLANKSVK DAMAKIQAKI PGLK 356 

: : I : I : I : : I I I I 

Db 1510 K NVGNVTVQQIGEIRPGDIVTFDKAKFSGQK 1540 



RESULT 10 
T13286 

cappuccino gene protein - fruit f ly (Drosophila melanogaster ) 



C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 #sequence_revision 13-Aug-1999 #text_change 17-Nov-2000 
C;Accession: T13286 

R;Emmons, S.; Phan, H. ; Calley, J.; Chen, W.; James, B.; Manseau, L. 
Genes Dev. 9, 2482-2494, 1995 

A;Title: Cappuccino, a Drosophila maternal effect rene required for polarity or 
the egg and embryo, is related to the vertebrate limb deformity locus. 
A;Reference number: Z17651; MUID: 96033799; PMID: 7590229 
A;Accession: T13286 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-1058 <EMM> 

A; Cross-references : EMBL:U34258; NID: gl061333; PID : gl061334 ; PIDN : AAC46925 . 1 
C; Genetics: 
A; Gene: capu 

A;Cross-references : FlyBase : FBgn0000256 

Query Match 14.2%; Score 258.5; DB 2; Length 1058; 

Best Local Similarity 16.2%; Pred. No. 1.9; 

Matches 114; Conservative 51; Mismatches 103; Indels 435; Gaps 27; 

Qy 4 IDQSSLVS 11 

Db 340 LESASLASLGAGGVAGSLATIATATTASSDNQKTLQQILKKRLLNCSTLAEVHAVVNELL 399 



QY 



12 SSTDSPPRPP PAFKYQF VTEPEDEEDEEEEEDEE 45 

II I III I II" I I I : I I I I 

Db 400 SSVDEPPRRPSKRCVNLTELLNASEATVYEYNKTGAEGCVKSFTDAETQTESEDCEGTCK 459 



QY 
Db 

Qy 



46 EDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAP 90 

Ml III IN Ml: Ml 

4 60 CGQSSTKVSDNKSAKEDGE KPHA VAPPPPPPPPPLPAFVAPPPPPPP 506 



91 RGPLP AAPPAAPERQPSWERSPAAPAPS LPPAAAVLPSK 129 

II III I I :| I I: II : Ml 

Db 507 ppppppLANYGAPPPPPPPPPGSGSAPPPPPPAPIEGGGGIPPPPPPMSASPSKTTISPA 566 

Qy 130 -LPEDDE PP ARPPPPPP AG 147 

||: I M I I I M :l 

Db 567 PLPDPAEGNWFHRTNTMRKSAVNPPKPMRPLYWTRIVTSAPPAPRPPSVANSTDSTENSG 626 



148 ASPLAEPA APPSTPAAPKRRGSGSSWDLLYWRDIKKT 185 

Mill 111:111 I M :M 

627 SSPDEPPAANGADAPPTAPPATKE IWTEIEETPLDNIDEFTELFSRQAI 675 



Qy 

Db 

Qy 

D b 676 APVSKPKELKVKRAKSIKVLDPERSRNVGII WRSLHVPS 714 



186 : GWFGASLFLLLSLTVFSIVSVTAYIALALLS 217 

I:: : M : I 



Qv 218 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRL 277 

| || : : : : | : I I M I I : : I I I : I : 

Db 715 SEIEHAIY HIDTSV-VSLEALQHMSN IQATEDELQRI 750 

Qy 278 FLVD DL 283 

MM ' „ 

Db 751 KEAAGGDI PLDHPEQFLLDI SLI SMASERI SCIVFQAEFEESVTLLFRKLETVSQLSQQL 810 



Qy 2 84 VDS L K FAVLMWVFT YVGAL FN G LTLL — 309 

: : t I I : : : : : I : II III 

Db 811 IESEDLKLVFSIILTLGNYM NGGNRQRGQADGFNLDILGKLKDVKSKESHTTLLHF 866 

Qy 310 309 

Db 867 IVRTYIAQRRKEGVHPLEIRLPIPEPADVERAAQMDFEEVQQQIFDLNKKFLGCKRTTAK 926 

Qy 310 ILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKI 352 

: I I I I I I :::::: I : I I III: : 

Db 927 VLAA SRPEIMEPFKSKMEEFVEGADKS MAKLHQSL 961 



RESULT 11 
S24407 

formin isoform IV - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 19-Feb-1994 #sequence_revision 10-Nov-1995 #text_change 05-Nov-1999 
C; Accession: S24 4 07 

R; Jackson-Grusby, L . ; Kuo, A.; Leder, P. 
Genes Dev. 6, 23-31, 1992 

A; Title: A variant limb deformity transcript expressed in the embryonic mouse 
limb defines a novel formin. 

A; Reference number: S24407; MUID : 92112033 ; PMID : 1339380 
A; Access ion: S24407 
A; Molecule type: mRNA 
A; Residues: 1-1206 <JAC> 

A; Cross-references: EMBL:X6237 9 ; NID:g51552; PIDN: CAA44244 . 1; PID:g51553 

Query Match 14.1%; Score 257.5; DB 2; Length 1206; 

Best Local Similarity 14.1%; Pred. No. 2.8; 

Matches 95; Conservative 39; Mismatches 95; Indels 445; Gaps 22; 

Qy 13 STDSPPRPPP AFKYQFV 29 

III I Ml I : 

Db 373 STDQESHKSPRDAHVQGGQVKARTPET7^LEAFKALFIRPPKKGST7\DTSELEALKRKMKH 432 

Qy 30 TEPEDEEDEEEEEDE EEDD 48 

: I I : : I : I : I I 
Db 433 EKESLRAVFERSKSRPADSPSDPKSPDQSPTEQDDRTPGRLQAWPPPKTKDTEEKVGLK 492 

Qy 4 9 EDLEELEVLER 59 

I I I : I I : 

Db 493 YTEAEYQAAILHLKREHKEEIETLQAQFELKTFHIRGEHALVTARLEEAIENLKQQLEKR 552 

Qy 60 KPAAGLSAAA 69 

II II 

Db 553 REGCEEMRDVCISTDDDCSPKAFRNVCIQTDRETFLKPCDAESKATRSSQIVPKKLTISL 612 

Qy 7 0 VPPAAAAPLLDFSSDSVPPAP 90 

II II : I I I 

Db 613 TQLSPSKDSKDIHAP FQTREGTSSSSQQKISPPAPPTPPPLPPPL IPPPPPLP 665 

Qy 91 — RGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPP 145 

I I I I I I I I I : I I II I : : I III MINI 

Db 666 PGLGPLPPAPPIPPV CPVSPPPPPPPPP PTPVPPSDGPPPPPPPPPPLPN 715 



Qy 146 AGASP LAEPAAP PSTPAAPKRRG 168 

Db 716 VLALPNSGGPPPPPPPPPPPPGLAPPPPPGLSFGLSSSSSQYPRKPAIEPSCPMKP 771 

Qy 169 SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 228 

I I I 

Db 772 LYWTR 776 

Qy 22 9 IQAIQKSDEGHP FRAYLESEVAI SEELVQ 257 

II I I : I I II I :: I 

Db 777 IQINDKSQDAAPTLWDSLEEPHIRDTSEFEYLFSKDTTQQKKKPLSEAYEKKNKVKKIIK 836 

Qy 258 KYSNSA LGHVNSTIKELRR-LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL 309 

II: : : : : I : : : : : I I I I 
Db 837 LLDGKRSQTVGILISSLHLEMKDIQQAI FTVDD 869 

Qy 310 ILALI SLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 349 

: : : I : : : I I I : : : I : | | : 

Db 870 — SWDLETLAALYE NRAQEDELTKIRKYYETSKEEDLKLLDKPEQF 914 

Qy 350 AKIPGLKRKA 359 

I : I I : I 

Db 915 LHELAQI PNFAERA 92 8 



RESULT 12 
A59266 

unconventional myosin- 15 - human 
C; Species: Homo sapiens (man) 

C;Date: 02-Jun-2000 #sequence_revision 02-Jun-2000 #text_change 08-Sep-2000 
C;Accession: A59266 

R; Liang, Y. ; Wang, A.; Belyantseva, I. A.; Anderson, D.W.; Probst, F.J.; Barber, 
T.D.; Miller, W. ; Touchman, J.W.; Jin, L. ; Sullivan, S.L.; Sellers, J.R.; 
Camper, S.A. ; Lloyd, R.V.; Kachar, B.; Friedman, T.B.; Fridell, R. A. 
Genomics 61, 243-258, 1999 

A;Title: Characterization of the human and mouse unconventional myosin XV genes 
responsible for hereditary deafness DFNB3 and shaker 2. 
A; Reference number: A59266; MUID : 200217 62 ; PMID: 10552926 
A; Accession: A592 66 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: mRNA 
A; Residues: 1-3530 <LIA> 

A;Cross-references: GB:AF144094; NID : g6224 682 ; PIDN : AAF05903 . 1; PID:g6224683 

C;Superfamily : myosin motor domain homology 

F; 1225-1887/Domain: myosin motor domain homology <MMO> 

Query Match 14.1%; Score 256.4; DB 2; Length 3530; 

Best Local Similarity 10.1%; Pred. No. 45; 

Matches 145; Conservative 54; Mismatches 124; Indels 1106; Gaps 37; 

Qy 17 PPRPP PAFKYQFV TEPEDEEDEEE 40 

Mt 111:1 Mill: 

Db 394 PPEVPYFYPEESASAFVYPWVPPPIPSPHNPYAHAMDDIAELEEPEDAGVERQGTSFRLP 453 

Qy 41 EEDEE 45 

: I I I 

Db 454 SAAFFEQQGMDKPARSKLSLIRKFRLFPRPQVKLFGKEKLEVPLPPSLDIPLPLGDADEE 513 



Qy 4 6 EDDEDLEELEVL 57 

I I : I : I : : 

Db 514 EDEEELPPVSAVPYGHPFWGFLTPRQRNLQRALSAFGAHRGLGFGPEFGRPVPRPATSIA 573 

Qy 58 ERKPAAGL 65 

I : I I I I 

Db 574 RFLKKTLSEKKPIARLRGSQKTRAGGPAVREAAYKRFGYKLAGMDPEKPGTPIVLRRAQP 633 
Qy 66 65 

Db 634 RARSSNDARRPPAPQPAPRTLSHWSALLSPPVPPRPPSSGPPPAPPLSPALSGLPRPASP 693 

Qy 66 S AAAVP PAAAA 76 

: I I I I I I 

Db 694 YG S L RRH P P P WAAP AH VP PAP Q AS GWAFVE P P AVS P EVP P D L LAF P G P R P S FRG S RRRGA 753 

Qy 77 PLLDFSSDSVPPAPR GPL — PAAPP 99 

I I : I I I : I : I I II 

Db 754 AFGFPGASPRASRRRAWSPLASPQPSLRSSPGLGYCSPLAPPSPQLSLRTGPFQPPFLPP 813 

Qy 100 A APERQ PSWERSPAAPAPSLPP AAAVLPSKL 130 

I Ml I I :| I II :: I ||:| 

Db 814 ARRPRSLQESPAPRRAAGRLGPPGSPLPGSPRPPSPPLGLCHSPRRSSLNLPSRLPHTWR 873 

Qy 131 PEDDE 135 

Db 874 RLSEPPTRAVKPQVRLPFHRPPRAGAWRAPLEHRESPREPEDSETPWTVPPLAPSWDVDM 933 

Qy 136 -PPARPPPPPPAGA SPLAEPAAP 157 

I III I I II II :| I 

Db 934 PPTQRPPSPWPGGAGSRRGFSRPPPVPENPFLQLLGPVPSPTLQPEDPAADMTRVFLGRH 993 

Qy 158 PSTPAAPK 165 

I III II 

Db 994 HEPGPGQLTKSAGPTPEKPEEEATLGDPQLPAETKPPTPAPPKDVTPPKDITPPKDVLPE 1053 

Qy 166 rr 167 

I I 

Db 1054 QKTLRPSLSYPLAACDQTRATWPPWHRWGTLPQAAAPLAPIRAPEPLPKGGERRQAAPGR 1113 
Qy 168 167 

Db 1114 FAWMPRVQKLSSFQRVGPATLKPQVQPIQDPKPRACSLRWSCLWLRADAYGPWPRVHTH 1173 

Qy 168 GSGSS WDLLYWRDI 182 

I I- I I: h 

Db 1174 PQSCHLGPGAACLSLRGSWEEVGPPSWRNKMHSIRNLPSMRFREQHGEDGVEDMTQLEDL 1233 

Qy 183 KKTGWFGASLFLLLSLTVFSIVSVT AYIALALLSVT--ISFRI Y 225 

: = i III:: II I : I I I I I 

Db 1234 QET TVLSNLKI RFERNLI YT YI GS I LVSVNP YQMFGI YGPEQVQQY 1279 

Qy 226 K 226 

Db 1280 NGRALGENPPHLFAVANLAFAKMLDAKQNQCIIISGESGSGKTEATKLILRYLAAMNQKR 1339 



Qy 227 GVIQAIQ KSDEGHPFRAYLE S EVAI SEE LVQK 258 

|:| I: ::t I ::| I |: |::| 

Db 1340 EVMQQIKILEATPLLESFGNAKTVRNDNSSRFGKFVEIFLEGGVISGAITSQYLLEKSRI 1399 

Qy 259 YSNSA 263 

Db 1400 VFQAKNERNYHIFYELLAGLPAQLRQAFSLQEAETYYYLNQGGNCEIAGKSDADDFRRLL 1459 

Qy 264 LGH 266 

Db 1460 AAMEVLGFSSEDQDSIFRILASILHLGNVYFEKYETDAQEVASWSAREIQAVAELLQIS 1519 

Qy 267 VNSTIKELRRLF LVD — DLVDSLKFAVLM-WVFTYVGAL 302 

I I : : I : : I II I : : : I : I I : I I I I 

Db 152 0 PEGLQKAITFKVTETMRE — K I FT P LT VE S AVDARDAI AKVL YALL F S WL I T RVNAL VS P 1577 

Qy 303 FNGLTLLIL ALISLFSIPVI YERHQVQ 329 

II I : I I I : I III: 

Db 1578 RQDTLSIAILDIYGFEDLSFNSFEQLCINYANENLQYLFNKIVFQEEQEEYIREQIDWQE 1637 

Qy 330 329 

Db 1638 ITFADNQPRINLISLKPYGILRILDDQCCFPQATDHTFLQKCHYHHGANPLYSKPKMPLP 1697 

Qy 330 IDHYLGLA 337 

I I I I 

Db 1698 EFTIKHYAGKVTYQVHKFLDKNHDQVRQDVLDLFVRSRTRWAHLFSSHAPQAAPQRLGK 1757 

Qy 338 NKSVKDAMAKIQ AKIPGL 355 

: I : I : I :: I I I I 

Db 1758 SSSVTRLYKAHTVAAKFQQSLLDLVEKMERCNPLFMRCLKPNHKKEPGL 1806 



RESULT 13 
S11515 

formin - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 22-Jan-1994 #sequence_revision 10-Nov-1995 #text_change 05-Nov-1999 
C;Accession: S11515 

R;Woychik, R.P.; Maas, R.L.; Zeller, R. ; Vogt, T.F.; Leder, P. 
Nature 346, 850-853, 1990 

A; Title: 1 Formins ! : proteins deduced from the alternative transcripts of the 
limb deformity gene. 

A; Reference number: S11515; MUID : 90363291 ; PMID:2392150 
A;Accession: S11515 
A; Molecule type: mRNA 
A; Residues: 1-1468 <WOY> 

A;Cross-references : EMBL:X53599; NID:g52877; PIDN: CAA37668 . 1; PID:g52878 

Query Match 14.0%; Score 254.9; DB 2; Length 1468; 

Best Local Similarity 14.5%; Pred. No. 5.7; 

Matches 97; Conservative 40; Mismatches 103; Indels 431; Gaps 23; 

Qy 2 EDIDQSSLVSSSTDSPPRP— PPA FKYQ 27 

I : : I : I I : II III 

Db 602 EEASEKGLGPEKITAPPQHQLPPGIASEGFPCDNFKEQTAKDLPNKDGGVWVPGYRAGPP 661 



Qy 28 — FVTEPEDEEDEEEE EDEEEDD 4 8 

i : ||: I I : I I 

Db 662 CPFLLHEEKEKTSRSELYLDLNPDQSPTEQDDRTPGRLQAWPPPKTKDTEEKVGLKYTE 721 

Qy 49 EDLEELEVLER 59 

I I I : I I : 

Db 722 AEYQAAI LHLKREHKEEI ETLQAQFELKTFHI RGEHALVTARLEEAI ENLKQQLEKRREG 781 

Qy 60 KPAAGLSAAA 69 

II II 

Db 782 CEEMRDVCISTDDDCSPKAFRNVCIQTDRETFLKPCDAESKATRSSQIVPKKLTISLTQL 841 

Qy 70 VPPAAAAPLLDFSSDSVPPAP R 91 

II II : I I I 

Db 842 SPSKDSKDIHAPFQTREGTSSSSQQKISPPAPPTPPPLPPPL IPPPPPLPPGL 894 

Qy 92 GPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPP 145 

I I I I I II I I : I I II I : : I III I I II I I 

Db 895 GPLPPAPPIPPV CPVSPPPPPPPPP PTPVPPSDGPPPPPPPPPPLPNVLA 944 

Qy 146 AGASP LAEPAAP PSTPAAPKRRGSGS 171 

: I I I I I I I I I I 
Db 945 LPNSGGPPPPPPPPPPPPGLAPPPPPGLSFGLSSSSSQYPRKPAIEPSCPMKP 997 

Qy 172 SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 231 

III II 
Db 998 LYWTR IQI 1005 

Qy 232 IQKSDEGHP FRAYL E S EVAI SEE LVQ 257 

I I : I I I I I : : I 

Db 1006 NDKSQDAAPTLWDSLEEPHIRDTSEFEYLFSKDTTQQKKKPLSEAYEKKNKVKKIIKLLD 1065 

Qy 258 -KYSNSA LGHVNSTIKELRR-LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 312 

II: : : : : | : : : : : | | | | : 
Db 1066 GKRSQTVGI LI S S LHLEMKDI QQAI FTVDD S 1096 

Qy 313 LISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 349 

:: I ' I I |:: :| : ||: 

Db 1097 WDLETLAALYE NRAQEDELTKIRKYYETSKEEDLKLLDKPEQFLHE 1143 

Qy 350 -AKI PGLKRKA 359 

I : I I : I 

Db 1144 LAQIPNFAERA 1154 



RESULT 14 
T26998 

hypothetical protein Y4 8B6A.6 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T2 6998 
R;Wall, M. 

submitted to the EMBL Data Library, September 1999 
A; Reference number: Z20297 
A; Accession : T2 69 98 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 



A; Residues: 1-716 <WIL> 

A;Cross-references : EMBL : AL110490 ; NID : el542263 ; PIDN : CAB54442 . 1 ; CESP : Y4 8B6A. 6 

A; Experimental source: clone Y48B6A 

C; Genetics : 

A; Gene: CESP : Y48B6A. 6 

A;Introns: 38/3; 196/3; 437/2; 460/3; 518/1; 617/3; 673/3 

Query Match 14.0%; Score 254.7; DB 2; Length 716; 

Best Local Similarity 16.9%; Pred. No. 0.97; 

Matches 101; Conservative 39; Mismatches 106; Indels 353; Gaps 23; 

Qy 5 DQSSLVSSSTD SPPRPP PAFKYQFVTEPEDEEDEEEEEDEEEDD 4 8 

: I I I 1 II I I I I : I Ihll I I : 

Db 178 EMESLVQKQVDVLDQIMSSPPPPPTSPPQLPPTPPARITSVRSDESIEEEERRRKESEET 237 

Qy 4 9 EDLEELEV LERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLP 95 

I I I I : I I III I : I I I I I I I I 
Db 238 AS FEELEAEIMRI S RS P VPP PVL SIPPPPPPNIPLPTIPQEVQSP 282 



Qy 96 AAPPAAPERQPSWE RSPAAPAPSLPP 121 

: I I I II: : : | | | | | 

Db 283 PSPRPTSVPPPIPSPGPSEDVNMDELIESFSDSVIFNNSMSPPPPLPPLRESSLETLEVT 342 



Qy 122 AAAVLPSKLPE DDEPPA 138 

I : I I : I I I I 

Db 343 PEDPVTESKVEASPTPLPKATESLNESSIKALEGLEVKALEAQEASDDRPSAPTPIRDSS 402 



Qy 139 RPPPPPPAGASPLA EPAAP PSTPAAPKRR 167 

I I I I I I : I I I I : : I I : I I I I : 

Db 403 LPPPPPPKPETPLAIRRAGPIPTPQLLEMIHQEDCSIRPSSPTSVSHGSRPQSPAVPKKP 462 



Qy 168 GSGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 221 

I I I : I 

Db 463 SVTVSPLGLLC 473 



Qy 222 FRI YKGVI QAI - 232 

: | : : : 

Db 474 DPNLSIEKPEEMKTEDTKPVETAPAPVDEAELNDALDRRNKINEATCLTKIFPSLVSKYF 533 



Qy 233 QKSDEGHP FRAYLESE 248 

:: i I I I : I I : 

Db 534 QNCSFDFLNDKRSGENVPLKMNKKISLYAEFSEFSRKQIQYFSGIFKKYDEDQDSYIDFN 593 

Qy 249 VAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLM 293 

: I : : I I : : I I : : I I I : I I : 
Db 594 ELKRMMEKLGEAQTHIAL-KELIKKVDEDQDGKISQ REFFLI FRLAASGE 642 

Qy 2 94 WVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI 352 

I : : I : I I : I I I I I I I I 

Db 643 LSCSEVFK — TLAESVDVSKEGVLGAAN FFQAKI 674 



RESULT 15 
F86387 

probable Pto kinase interactor [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 31-Mar-2001 



C; Accession: F86387 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0. 
Alonso, J.; Altaf, H. ; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E- ; 
Chan, A.; Chao, Q. ; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L . ; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Felclblyum, T.V.; Feng, J.; Fong, B. ; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A; Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E. 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B . ; Langin 
Hooper, S.; Lee, A.; Lee, J.M. ; Lenz, C.A.; Li, J.H.; Li, Y . ; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M. ; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G. ; Peterson, J.; Pham, P.K. 
Rizzo, M. ; Rooney, T.; Rowley, D. ; Sakano, H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; 
Tall on, L.J.; Tambunga, G. ; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D.; Yu, G.; Fraser, CM.; 
Venter, J.C.; Davis, R.W. 

A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A; Reference number: A86141; MUID : 210167 19 ; PMID : 11130712 

A;Accession: F86387 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-760 <STO> 

A; Cross-references: GB:AE005172; NID : gll07 9512 ; PIDN : AAG2 9223 . 1 ; GSPDB : GN0014 1 
C; Genetics : 
A;Map position: 1 

Query Match 13.9%; Score 253.8; DB 2; Length 760; 

Best Local Similarity 17.1%; Pred. No. 1.2; 

Matches 115; Conservative 42; Mismatches 132; Indels 382; Gaps 25; 

Qy 2 EDIDQS-SLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I : : I II I : | | | : I II I : 

Db 10 EEVSLSPSLASPPLMALPPPQPSFPGDNATSP TRE 44 

Qy 61 PAAGLSAAAV-PPAAAAPLLDFSSDSVP PAPRGPLP 95 

I I | | : : | : I I I I II 

Db 45 PTNGNPPETTNTPAQSSPPPETPLSSPPPEPSPPSPSLTGPPPTTIPVSPPPEPSPPPPL 104 

Qy 96 AAP PAAPERQPSWERS PAAP APSLP 12 0 

I I I I I I III I I I I I 

Db 105 PTEAPPPANPVSSPPPESSPPPPPPTEAPPTTPITSPSPPTNPPPPPESPPSLPAPDPPS 164 

Qy 121 PAAAVLPSKLPEDDEPPARPP 141 

II: II 1111:11 
Db 165 NPLPPPKLVPPSHSPPRHLPSPPASEIPPPPRHLPSPPASERPSTPPSDSEHPSPPPPGH 224 

Qy 142 P P P PAGAS P LAE PAAP PSTPAAPKR 166 

I I I I I I : I I I I : I : 

Db 225 PKRREQPPPPGSKRPTPSPPSPSDSKRPVHPSPPSPPEETLPPPKPSPDPLPSNSSSPPT 284 

Qy 167 RGSGSSWDLLYWRDIKKTGWFG 190 

III::: III 

Db 285 LLPPSSWSPPSPPRKSVSGPDNPSPNNPTPVTDNSSSSGISIAAW GVSIG 336 



QY 



191 ASLFLLLSLTVFSIV- 



205 



Db 337 VALVL LTLIGVWCCLKKRKKRLSTIGGGYVMPTPMESSSPRSDSALLKTQSSAPLV 393 

Qy 2 06 SWAY I ALA LLSVTISF RIYKGV 228 

I I : : : I : I I I : I I I I 

Db 394 GNRSSNRTYLSQSEPGGFGQSRELFSYEELVIATNGFSDENLLGEGGFGRVYKGVLPDER 453 

Qy 229 IQAIQKSDEG HPFRAYLESEVAI SEELVQKYSNSALGHWSTIKELRRLFLVD-- 2 81 

: | : : : I I : I : : : : : : : I I I I I I I : I 

Db 454 WAVKQLKI GGGQGDREFKAEVDT ISRVHHRNLLSMVGYCISENRRLLIYDYV 506 

Qy 282 DLVDSLKFAVLMWVFTYVGALFNGLTLL 309 

I : I I I I I I I 

Db 507 PNNNLYFHLHGTPGLDWATRVKIA AGAA-RGLAYLHEDCHPRI IHRDIKSSN 557 

Qy 310 IL ALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPG 354 

II I I : I I Mill I :: I 

Db 558 I LLENNFHALVS DF GLA-KLALDCNTHITTRVMGTFGYMAPEY 599 

Qy 355 LKRKAD 360 

I 1:1 

Db 600 ASSGKLTEKSD 610 



Search completed: September 29, 2004, 18:55:42 
Job time : 35.4632 sees 



GenCore version 5.1-6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

September 29, 2004, 18:40:14 ; Search time 76.3939 Seconds 

(without alignments) 
1486.854 Million cell updates/sec 

SEQ2_1-171_975-1163 
1823 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 360 

BLOSUM62 

Gapop 10.0 , Gapext 0.1 

1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SPTREMBL 25:* 



1: 




sp archea:* 


2: 




sp bacteria:* 


3: 




sp fungi : * 


4: 




sp_human: * 


5' 




sp invertebrate:* 


6 




sp mammal : * 


7 




sp_mhc:* 


8 




sp organelle:* 


9 




sp phage:* 


10: 


sp_plant:* 


11: 


sp rodent:* 


12 


sp virus:* 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


: sp rvirus:* 


16 


: sp bacteriap:* 


17 


: sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



1 


1721.6 


94. 


4 


356 


11 


Q8BH78 


Q8bh78 mus musculu 


2 


1712.7 


93. 


9 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


3 


1711.5 


93. 


9 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


4 


1637 


89. 


8 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


5 


1622.9 


89. 


0 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


6 


1598.4 


87. 


7 


392 


4 


Q96B16 


Q96bl6 homo sapien 


7 


1012.1 


55. 


5 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


8 


1010.2 


55. 


4 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


9 


992.2 


54. 


4 


578 


11 


Q80W95 


Q80w95 mus musculu 


10 


988.7 


54. 


2 


639 


11 


Q8K290 


Q8k290 mus musculu 


11 


878 


48. 


2 


184 


6 


Q7YRW9 


Q7yrw9 bos taurus 


12 


874.8 


48. 


0 


199 


13 


Q7T224 


Q7t224 gallus gall 


13 


855. 6 


46. 


9 


720 


11 


Q7TNB7 


Q7tnb7 mus musculu 


14 


844 


46. 


3 


179 


6 


Q9GM33 


Q9gm33 macaca f asc 


15 


826.2 


45. 


3 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


16 


823.2 


45. 


2 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


17 


793. 9 


43. 


5 


760 


13 


Q90638 


Q90638 gallus gall 


18 


736.6 


40. 


4 


214 


13 


Q7T222 


Q7t222 carassius a 


19 


687.7 


37. 


7 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


20 


683 


37. 


5 


199 


4 


Q9BQ59 


Q9bq59 homo sapien 


21 


675.4 


37. 


0 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


22 


669 


36. 


7 


267 


11 


Q63765 


Q63765 rattus sp. 


23 


665 


36. 


5 


208 


13 


Q90637 


Q90637 gallus gall 


24 


659.9 


36. 


2 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


25 


657.4 


36. 


1 


236 


11 


Q8VBU0 


Q8vbu0 rattus norv 


26 


588. 6 


32. 


3 


221 


13 


Q7ZUD6 


Q7zud6 brachydanio 


27 


527.7 


28. 


,9 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


28 


523.9 


28. 


,7 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


29 


521 


28. 


. 6 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


30 


518 


28. 


,4 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


31 


437.7 


24, 


.0 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


32 


427.4 


23. 


.4 


2607 


5 


Q23187 


Q23187 caenorhabdi 


33 


344 


18. 


. 9 


107 


13 


Q7T223 


Q7t223 carassius a 


34 


337.5 


18, 


.5 


222 


5 


Q23188 


Q2 3 1 8 8 caenorhabdi 


35 


286. 9 


15. 


.7 


1312 


4 


Q7Z5V7 


Q7z5v7 homo sapien 


36 


286.9 


15, 


.7 


1343 


4 


Q9H7N4 


Q9h7n4 homo sapien 


37 


278.9 


15. 


.3 


1312 


4 


Q9NR59 


Q9nr59 homo sapien 


38 


272.4 


14, 


.9 


1217 


4 


Q9ULL5 


Q9ull5 homo sapien 


39 


266.9 


14, 


.6 


1173 


11 


Q63624 


Q63624 rattus norv 


40 


265.5 


14, 


.6 


1611 


3 


042854 


0428 54 schizosacch 


41 


263.6 


14 


.5 


1179 


12 


Q91L98 


Q91198 white spot 


42 


263.5 


14 


.5 


1180 


12 


Q8VAS9 


Q8vas9 white spot 


43 


263.4 


14 


.4 


1709 


4 


015047 


O15047 homo sapien 


44 


263 


14 


.4 


1185 


12 


Q8QTC5 


Q8qtc5 white spot 


45 


263 


14 


.4 


1627 


10 


Q84ZL0 


Q84zl0 oryza sativ 



ALIGNMENTS 



RESULT 1 
Q8BH78 

ID Q8BH78 PRELIMINARY; PRT; 356 AA. 

AC Q8BH7 8; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE 


RTN4 . 




GN 


RTN4 . 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata 


; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia 


; Sciurognathi; Muridae; Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


RA 


Oertle T., van der Putten H. 


, Schwab M. E. ; 


RT 


"Genomic Structure and Functional Characterization of the Promoter 




Structures of Human and Mouse Nogo/Rtn-4 . " ; 


RL 


Submitted (OCT-2002) to the 


EMBL/ GenBank/DDBJ databases. 


RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7, and 129SvcJ7; 


R A 


Oertle T . , Schwab M.E.; 




RL 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


RN 


[3] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=129/SvcJ7 ; 




R A 


Van der Putten H- ; 




RT, 

1\ J_l 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


RN 


[4] 




RP 


SEQUENCE FROM N.A. 




RP 


STRAIN=129SvcJ7; 




RA 


Van der Putten H., Mir A.; 




RL 


Submitted (MAY-2002) to the 


EMBL/ GenBank/DDBJ databases. 


DR 


EMBL; AY102281; AAM73503.1; 




DR 


EMBL; AY102286; AAM73508.1; 




riR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO: 0005783; C : endoplasmic reticulum; IDA. 


DR 


GO; GO: 0007399; P : neurogenesis ; IDA. 


DR 


InterPro; IPR003388; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 


1. 


SQ 


SEQUENCE 356 AA; 38403 MW; 4366C03BA9630B56 CRC64; 



Query Match 94.4%; Score 1721.6; DB 11; Length 356; 

Best Local Similarity 96.4%; Pred. No. 1.3e-63; 

Matches 347; Conservative 2; Mismatches 7; Indels 4; Gaps 3 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I : I I I I I M I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

.lllhll Ml I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I 
Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVP PAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWR 180 

I I I I I II I I II I I I I I I I I II II t I I I I I I II I I M I I I I I I I I I I I I I I M I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — PPAPAGASPLAEPAAPPSTPAAPKRRGSGS WVDLLYWR 176 

Qy 181 DIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHP 240 

I I I I II I I II I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 177 DIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHP 236 



Qy 241 FRAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG 300 

II I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 237 FRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG 296 

Qy 301 ALFNGLTLLIL7VLISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 
Db 297 ALFNGLTLLIIALISLFSIPVIYERHQAQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAE 356 



AC 

DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RC 

RA 

RT 

RT 

RL 

RN 

RP 

RC 

RA 

RL 

RN 

RP 

RC 

RA 

RL 

RN 

RP 

RC 

RA 

RL 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

SQ 



Created) 

Last sequence update) 
Last annotation update) 



Craniata ; Vertebrata ; Euteleos tomi ; 
Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 



RESULT 2 
Q8BHF5 

ID Q8BHF5 PRELIMINARY; PRT; 375 AA. 

Q8BHF5; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4 . 
RTN4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., van der Putten H., Schwab -M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4 . " ; 
Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 
[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T-, Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7; 
Van der Putten H. ; 
Submitted (MAY-2 0 02) 
[4] 

SEQUENCE FROM N.A. 
STRAIN-129SvcJ7; 
Van der Putten H., Mir A. ; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AY102282; AAM73504.1; -. 
EMBL; AY102286; AAM73509.1; 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 375 AA; 40300 MW; 23D9EB19BE671AE6 CRC64; 



to the EMBL/ GenBank/DDBJ databases. 



Query Match 93.9%; Score 1712.7; DB 11; Length 375; 

Best Local Similarity 91.6%; Pred. No. 3.5e-63; 



Matches 347; Conservative 3; Mismatches 6; Indels 23; Gaps 4 



Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I M I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I t II I I I II 
Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVP PAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSG — 170 

I I I I || M I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 176 

Qy 171 S S WDLLYWRDI KKTGWFGAS LFLLLSLTVFS I VS VTAYI ALALLSVT I S 221 

I : I I I I I I M I II I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I II I I I I I I I I 
Db 177 AASEPVI PS SAVVDLLYWRD I KKTGVVFGAS LFLLLSLTVFS I VS VTAYI ALALLSVT I S 236 

Qy 222 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVD 281 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 237 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVD 296 

Qy 282 DLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSV 341 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I II I 
Db 297 DLVDSLKFAVTiMi^FTWGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGLANKSV 356 

Qy 342 KDAMAKIQAKI PGLKRKAD 360 

I I M I I I I I I I I I I I I I I : 
Db 357 KDAMAKIQAKI PGLKRKAE 375 



RESULT 3 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT; 357 AA. 

AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Li R. , Long M. , Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AY114153; AAM77069.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 



SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64; 



Query Match 93.9%; Score 1711.5; DB 11; Length 357; 

Best Local Similarity 96.1%; Pred. No. 3.5e-63; 

Matches 347; Conservative 2; Mismatches 7; Indels 5; Gaps 4 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

1 I I I I I I I III I I I I I I I I I I I I I I! I I I I I I I I I I 1 I I I II I I I I I I I I I I I I II 
Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWR 180 

I I I I I I I I I I I I I II I I I I II I I I II I I I I I I I I I I I II I I I II I I I I I I I I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — PPAPAGASPLAEPAAPPSTPAAPKRRGSGS VWDLLYWR 176 

Qy 181 DIKKTGW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGH 239 

I I I M I I I II I I I I I I I I II I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M 
Db 177 DIKKTGVVYFGASLFLLLSLTVFSIVSVTAYIAIiALLSVTISFRIYKGVIQAIQKSDEGH 236 

Qy 240 PFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYV 299 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 237 PFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYV 296 

Qy 300 GALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKA 359 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 297 GALFNGLTLLI LALI SLFS I PVIYERHQAQI DHYLGLANKSVKDAMAKI QAKI PGLKRKA 356 

Qy 360 D 360 

Db 357 E 357 



RESULT 4 
Q8BGM9 

ID Q8BGM9 PRELIMINARY; PRT; 1162 AA. 

AC Q8BGM9; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 



RC STRAIN=12 9/SvcJ7, and 12 9SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; -. 

DR EMBL; AY102286; AAM73511.1; -. 

DR MGD; MGI : 1915835 ; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE1 178 IF CRC64; 



Query Match 89.8%; Score 1637; DB. 11; Length 1162; 

Best Local Similarity 29.8%; Pred. No. 9.5e-59; 

Matches 348; Conservative 2; Mismatches 6; Indels 810; Gaps 4; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I II I I I I I 1 I I I I I I I I I I I I I I I I I I I I I M I I I : I I I I I I I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I I I I Ml I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I II I I I I I I I I I I I 
Db 60 PAAGLSAAPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGS 171 

I I I 11 I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I 1 I I I M I I I I I 
Db 119 PAAAVLPSKLPEDDEPPAR — P P AP AGAS P LAE P AAP P S T P AAP KRRG S G S VD ET L FAL P 17 6 



Qy 172 171 

Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 172 — 171 

Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 172 171 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

Qy 172 171 

Db 357 MSWAPVREEYMFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGK 416 

Qy 172 171 



Db 417 DSESRNENASFPRTPELVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKI 47 6 

Qy 172 171 

Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEADYWTDNLSKVTEAWATMPEGLTPDLVQEA 536 

Qy 172 171 

Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 172 171 

Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIK 656 

Qy 172 171 

Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 172 171 

Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 77 6 

Qy 172 171 

Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFS 836 

Qy 172 171 

Db 837 DSSPIEIIDEFPTFVSAKDDSPKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNT 896 

Qy 172 171 

Db 897 YPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDTE 956 

Qy 172 SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALA 214 



Db 957 KEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALA 1016 

Qy 215 LLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKEL 274 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
Db 1017 LLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKEL 1076 

Qy 275 RRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYL 334 

I I I I 1 I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I II I I I M I I I I I I I I I I I 
Db 1077 RRLFLVDDLVDSLKFAVLMWVFT YVGALFNGLTLLILALI SLFS I PVI YERHQAQI DHYL 1136 

Qy 335 GLANKSVKDAMAKIQAKI PGLKRKAD 360 

I I I I I I I I I I I I I I I I I I I I I I I I I : 
Db 1137 GLANKSVKDAMAKIQAKI PGLKRKAE 1162 

RESULT 5 
Q8K3G8 

ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT 01-OCT-2002 (TrEMBLrel. 22 , Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 



DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A. 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=l 0 090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Long M., Li R. , Ju G.; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114152; AAM77068.1; 

DR MGD; MGI : 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 

Query Match 89.0%; Score 1622.9; DB 11; Length 1163; 

Best Local Similarity 29.7%; Pred. No. 3.7e-58; 

Matches 347; Conservative 2; Mismatches 7; Indels 811; Gaps 5; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

I I I I I II III I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 60 PAAGLSAVPVPP-AAAPLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAASAPSLP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGS 171 

I I M I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II I I I II I I 
Db 119 PAAAVLPSKLPEDDEPPAR — PPAPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 176 



Qy 172 171 

Db 177 AASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYL 236 

Qy 172 171 

Db 237 GNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGES 296 

Qy 172 171 

Db 297 AMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMK 356 

Qy 172 171 

Db 357 MSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKSHGK 416 

Qy 172 171 

Db 417 DSESRNENASFPSTPELVKDGSRAYITCDSFTSATESTAANIFPVLEDHTSENKTDEKKI 476 



Qy 172 — 171 

Db 477 EERKAQIITEKTSPKTSNPFLVAIHDSEMYVTTDNLSKVTEAWATMPEGLTPDLVQEA 536 

Qy 172 171 

Db 537 CESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEA 596 

Qy 172 171 

Db 597 PLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDAKEEIK 656 

Qy 172 171 

Db 657 EPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEKSVPDHCELVDDS 716 

Qy 172 171 

Db 717 SPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPY 776 

Qy 172 171 

Db 777 LESFQPNLHITKDAASNEIPTLTKKETISLQMEEFNTAI YSNDDLLSSKEDKMKESETFS 836 

Qy 172 171 

Db 837 DSSPIEIIDEFPTFVSAKDDSPKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNT 896 

Qy 172 171 

Db 897 YPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDTE 956 

Qy 172 SWDLLYWRDIKKTGW- FGASLFLLLSLTVFS IVSVTAYI AL 213 



Db 957 KEDRSLTAVLST^ELNKTSVVDLLYWRDIKKTGVVYFGASLFLLLSLTVFSIVSVTAYIAL 1016 

Qy 214 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 273 

1 I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I II 
Db 1017 ALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKE 1076 

Qy 274 LRRLFLVDDLVDSLKFAVLMWFTWGALFNGLTLLIIALISLFSIPVIYERHQVQIDHY 333 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1077 LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHY 1136 

Qy 334 LGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I : 
Db 1137 LGLANKSVKDAMAKIQAKI PGLKRKAE 1163 

RESULT 6 
Q96B16 



ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE Hypothetical protein (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorclata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T . , Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed=1248 8 097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; BC016165; AAH16165.1; -. 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E8 39E5 8AD CRC64; 

Query Match 87.7%; Score 1598.4; DB 4; Length 392; 

Best Local Similarity 82.7%; Pred. No. 2.3e-58; 

Matches 326; Conservative 12; Mismatches 20; Indels 36; Gaps 6 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

111:111 I I I I I 1 I I I I I 1 I I I I I I I I I I I I I I I I I I : II I I M I I I I M I I I 
Db 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

Qy 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAA P 115 

I 1 I I I I I I II III llhll : I I II I I I II I I I M I I I I I I I I I : II : I 
Db 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

Qy 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

III I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 



Qy 167 RGS GS S WDLL YWRDI KKTGWFGASLFLLLS LTVFS I VS 206 

III I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 179 RGS SGSVDETLFALPAASEPVI RS SAWDLLYWRD I KKTGWFGASLFLLLS LTVFS I VS 238 

Qy 207 VTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGH 266 

I I I I I I I I I I I I t II I I I I I I I II I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I 

Db 239 VTAYIALALLSVTISFRI YKGVI QAIQKSDEGHPFRAYLESEVAI SEELVQKYSNSALGH 298 

Qy 267 WSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERH 326 

II II I I I I I I II II I M I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I : I I I I I I I 

Db 299 VNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERH 358 

Qy 327 QVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I : 
Db 359 QAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 392 



RESULT 7 
Q8IUA4 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RT 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 



PRELIMINARY; 



PRT; 



986 AA. 



(TrEMBLrel . 
(TrEMBLrel. 
(TrEMBLrel. 



23, 
23, 
24, 



Created) 

Last sequence update) 
Last annotation update) 



Chordata; 
Primates ; 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Q8IUA4 
Q8IUA4; 
01-MAR-2 003 
01-MAR-2003 
01-JUN-2003 
RNT4 (RTN4) . 
RTN4 . 

Homo sapiens (Human 
Eukaryota; Metazoa; 
Mammalia; Eutheria; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

[2] 

SEQUENCE FROM N.A. 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
Van der Putten H. ; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 
TISSUE=Testis; 

MEDLINE=22376540; PubMed-12488097 ; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4."; 

J. Mol. Biol. 325:299-323(2003). 

EMBL; AY102285; AAM64244.1; 

EMBL; AY123245; AAM64249.1; 

EMBL; AY123246; AAM64250.1; -. 

EMBL; AY123247; AAM64251.1; -. 



DR EMBL; AY123248; AAM64252.1; 

DR EMBL; AY123249; AAM64253.1; 

DR EMBL; AY123250; AAM64254.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 986 AA; 108449 MW; 0CDE8F64 70364 15A CRC64; 

Query Match 55.5%; Score 1012.1; DB 4; Length 986; 

Best Local Similarity 27.7%; Pred. No. 7.3e-33; 

Matches 250; Conservative 33; Mismatches 71; Indels 549; Gaps 16; 

Qy 3 DIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDE EDEEE 4 0 

: : : I : I I : I II I I : I : I I I I 

Db 88 ELEYSEMGSSFSVS PKAESAVIVANPREEI IVKNKDEEEKLVSNNILHNQQELPTA 143 

Qy 41 EEDE EEDDED 50 

: I I I : : I I 

Db 144 LTKLVKEDEVVS SEKAKDS FNEKRVAVEAPMREEYADFKPFERWEVKDS KEDS DMLAAG 203 

Qy 51 LEEL EVLERKPAAGLSAAAVPP 72 

II: I :: : I :: I | 

Db 204 GKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP 263 

Qy 73 AA AAPLLDFSSDS 85 

II : I I : I I 

Db 264 AATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDY 323 

Qy 86 VP PAP RG P L P AAP P AAP E RQ 105 

I III II: 

Db 324 VTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEA/TGTKIAYETKMDLVQTSEVMQES 383 

Qy 106 PSWERSPAAPAPSLPP AAAVLPSKLP 131 

I I : I I I I : I I I I : : I I I 

Db 384 LYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESI 443 

Qy 132 — EDDEPP A 138 

I : I I I 
Db 444 KHEPENPPPYEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSA 503 

Qy 139 RPPP- PPPAGASPLAEPAAPPSTPA APKRR 167 

II I I I I :: I I I I : : : 

Db 504 EPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKE 563 

Qy 168 167 

Db 564 SLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIP 623 

Qy 168 167 

Db 624 LQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREY 683 

Qy 168 GSGS 171 

I : I I 

Db 684 TDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVL 743 



Qy 172 SWDLL 177 

I II 1 I I 

Db 744 LLPPDVSALATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLL 803 

Qy 178 YWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDE 237 

II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I II I I II I I I I I I I I I 
Db 804 YWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDE 863 

Qy 238 GHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFT 297 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 8 64 GHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFT 923 

Qy 298 YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DH YLGLANKS VKDAMAKI QAKI PGLKR 357 

I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : II I I I I I I I I I I II I I I 
Db 924 YVGALFNGLTLLI LALI SLFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKR 983 

Qy 358 KAD 360 

I I : 

Db 984 KAE 986 



RESULT 8 
Q8BGK7 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1046 AA. 



OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
DR 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus , 



Q8BGK7 
Q8BGK7; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel . 25, 
RTN4 . 
RTN4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=12 9/SvcJ7, and 129SvcJ7; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7; 
Van der Putten H.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 

STRAIN=129SvcJ7; 

Van der Putten H., Mir A. ; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AY102280; AAM73502.1; -. 



DR EMBL; AY102286; AAM73507.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E223 8ED51222 CRC64; 

Query Match 55.4%; Score 1010.2; DB 11; Length 1046; 

Best Local Similarity 29.2%; Pred. No. le-32; 

Matches 249; Conservative 25; Mismatches 80; Indels 498; Gaps 14; 

Qy 5 DQSSLVSSSTDSPPRPPPA FKYQFVTEPEDE 35 

I : I I I : I : I I I III 

Db 197 DKEDLVCS7\ALHNPQES PATLTKVVTCEDGVMS PEKTMDI FNEMKMS VVAPVREEYADFKP 256 

Qy 36 EDEEEEEDEEEDDED 5 0 

II I :: : I I 

Db 257 FEQAWEVKDTYEGSRDVLAARANMESKVDKKCFEDSLEQKGHGKDSESRNENASFPRTPE 316 

Qy 51 LEELEVLERKPAAGLSAAAVPP 72 

: I : : I I I I : I 
Db 317 LVKDGSRAYITCDSFSSATESTAANIFPVLEDHTSENKTDEKKIEERK — AQIITEKTSP 374 

Qy 73 AAAAPLLDFSSDS VPPAPRGPLPAAPPAAPERQ 105 

: I I I I I I I I II: 

Db 375 KTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEACESELNEATGTKIA 434 

Qy 106 PSWERSPAAPAPSLP PAAAVLPSKLPEDD 134 

I I : I : I I : I I I I : : I I I 

Db 435 YETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVA 494 

Qy 135 EP PARPPP 142 

: I i I I I 

Db 495 QPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIKEPESFNAAAQEAEA 554 

Qy 143 PPPA GASPLAEPA A 156 

II : I I : I I : 

Db 555 PYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDS 614 

Qy 157 PPSTP AAPKRRG 168 

II I : I : I 

Db 615 IPEVPQTQEEAVMLMKESLTEVSETVTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA 674 

Qy 169 168 

Db 675 ASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTF 734 

Q y 169 SG 170 

I I 

Db 735 VSAKDDSPKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNTYPKDEAHVSDEFSK 794 

Qy 171 170 

Db 7 95 SRSSVSKVPLLLPNVSALESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAEL 854 



Qy 171 --SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 228 

: I I 1 I I I I I I I I I I 1 I I I I M I I I I I I I 1 I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 855 NKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 914 

Qy 229 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 288 

I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I II I I I I I I 
Db 915 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLK 974 

Qy 289 FAVXMWFTYVGALFNGLTLLIIALISLFSIPVIYERHQVQ 348 

I I I I I I I I I I I I II I I I I I I I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 975 FAVI>1WFTWGALFNGLTLLILALISLFSIPVIYERH^ 1034 

Qy 34 9 QAKI PGLKRKAD 360 

I I I I I I I I I I I : 
Db 1035 QAKI PGLKRKAE 104 6 



RESULT 9 
Q80W95 

ID Q80W95 PRELIMINARY; PRT; 578 AA. 

AC Q80W95; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A (Fragment) . 

GN NOGO-A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tozaki H., Hirata T.; 

RT "The partial sequence of mouse nogo-A cDNA clone#4 109 . " ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases, 

DR EMBL; AB073672; BAC75974.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 578 AA; 63696 MW; 832670C171E4AC61 CRC64; 

Query Match 54.4%; Score 992.2; DB 11; Length 578; 

Best Local Similarity 42.4%; Pred. No. 1.2e-32; 

Matches 233; Conservative 13; Mismatches 79; Indels 225; Gaps 8; 

Qy 17 PPRPPP AFKYQFVTEPEDEEDEEEEEDEEEDDE DLEELEVLERKPAA 63 

I I I I I I I I : I I I II: I : I : 

Db 48 PENPPPYEEAMSVALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSP 107 

Qy 64 GLS AAAVP P AAAAP L L D F S S D S VP PAP R 91 

II : I I : | : | | M : I I : 

Db 108 GFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTE 167 

Qy 92 GPLPAAPPAAPERQP 106 

I I II 



Db 



168 VSETVTQHKHKERLSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEF 227 



Qy 107 106 

Db 228 NTAIYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSPKEYTDLEVSNKSE 287 

Qy 107 SWERSPAAPAPSLPPAAAVLPS- 12 8 

III 111:11 
Db 288 IANVQSGANSLPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVFKVPLLLPNVSALESQ 347 

Qy 129 KLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSG 170 

I I I I I I II : 

Db 348 IEMGNIVKPKVLTKEAEEKLPSDTEKEDR SLTAVLSAELNK 38 8 

Qy 171 S S WDLLYWRDI KKTGWFGASLFLLLSLTVFS I VSVTAYIALALLSVT I S FRI YKGVIQ 230 

: I I I I I I II I I I I I I I I I I I I I I I t I I I II II I II I I I I I I I I I I I M I I I I I I I I I I I I 
Db 389 TSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL7VLLSVTISFRIYKGVIQ 448 

Qy 231 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 290 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i M I I 
Db 449 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 508 

Qy 291 VLMWVFT YVGALFNGLTLLI LALI S LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QA 350 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I II I I I I I I I 
Db 509 VLMWVFT YVGALFNGLTLLI LALI S LFS I PVI YE RHQAQ I DHYLGLANKSVKDAMAKI QA 568 

Qy 351 KIPGLKRKAD 360 

I I I I I I I II : 
Db 569 KIPGLKRKAE 578 



RESULT 10 




Q8K290 




ID 


Q8K290 PRELIMINARY; PRT; 639 AA. 




AC 


Q8K290; 




DT 


01-OCT-2002 (TrEMBLrel. 22, Created) 




DT 


01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 




DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 




DE 


Hypothetical protein. 




GN 


RTN4 . 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 


Murinae ; Mus 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RA 


Strausberg R. ; 




RL 


Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 


DR 


EMBL; BC032192; AAH32192.1; 




DR 


MGD; MGI: 1915835; Rtn4 . 




DR 


GO; GO: 0005783; C : endoplasmic reticulum; IDA. 




DR 


GO; GO:0007399; P : neurogenesis ; IDA. 




DR 


InterPro; IPR003388; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 1. 




KW 


Hypothetical protein. 




SQ 


SEQUENCE 639 AA; 70312 MW; 309A19DA37603F11 CRC64; 



Query Match 54.2%; Score 988.7; DB 11; Length 639; 

Best Local Similarity 42.2%; Pred. No, 2.2e-32; 

Matches 232; Conservative 14; Mismatches 79; Indels 225; Gaps 8; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



17 PPRPPP AFKYQFVTEPEDEEDEEEEEDEEEDDE DLEELEVLERKPAA 63 

I I I I I I 11:11 I II: I : I : 

109 PENPPPYEEAMSVALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSP 168 



64 GLS- 



92 



107 



91 



AAAVP PAAAAPLLDFSSDSVPPAPR 

I : I I : I : I I I I : I I : 

169 EFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTE 228 



106 



GPLPAAPPAAPERQP 

I I II 

229 VSETVTQHKHKERLSASPQEVGKPYLESFQPNLHITKDAASNEIPTLTKKETISLQMEEF 288 



106 



2 89 NTAIYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDSPKEYTDLEVSNKSE 34 8 

107 SWERS PAAP AP S L P PAAAVLP S - 128 

I II : III : I I 
349 IANVQSGANSLPCSELPCDLSFKNTYPKDEAHVSDEFSKSRSSVSKVPLLLPNVSALESQ 408 

129 KLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSG 17 0 

I I I I I I II : 

409 IEMGNIVKPKVLTKEAEEKLPSDTEKEDR SLTAVLSAELNK 449 

171 SSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 230 

: I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I i I I I I II I II I I I I I I 
450 TSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQ 509 

231 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 290 

I I I I I I I I I I I I II II II I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
510 AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFA 569 



291 



570 



351 



630 



VLMWVFTYVGALFNGLTLLI LALISLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQA 350 

I I I I I I I I I II i II I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I II I I 
VLMWVFTYVGALFNGLTLLI LALISLFS I PVI YERHQAQI DHYLGLANKSVKDAMAKIQA 629 



KIPGLKRKAD 
I I I I I I II I : 
KIPGLKRKAE 



360 



639 



RESULT 
Q7YRW9 



11 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 



Q7YRW9 PRELIMINARY; PRT; 184 AA. 

Q7YRW9; 

01-OCT-2003 (TrEMBLrel. 25, 
01-OCT-2003 (TrEMBLrel. 25, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4w (Fragment) . 
RTN4 . 

Bos taurus (Bovine) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 



Created) 

Last sequence update) 
Last annotation update) 



Euteleostomi; 



OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia ; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12 832288 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; 

FT NON_TER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF84A1 CRC64; 

Query Match 48.2%; Score 878; DB 6; Length 184; 

Best Local Similarity 96.2%; Pred. No. 3.6e-29; 

Matches 177; Conservative 3; Mismatches 4; Indels 0; Gaps 0 

Qy 177 LYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSD 236 

I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1 LYWRDI KKTG VVFGXXLFLLLSLTVFS I VSVTAYI ALALLSVT I S FRI YKGVIQAIQKSD 60 

Qy 237 EGHPFRAYLESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVF 296 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 61 EGHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVF 12 0 

Qy 297 TYVGALFNGLTLLILALISLFSIPVI YERHQVQIDHYLGLANKSVKDAMAKIQ7VKIPGLK 356 

I I I I I I I I II I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 121 TWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 180 

Qy 357 RKAD 360 

111: 

Db 181 RKAE 184 



RESULT 12 
Q7T224 

ID Q7T224 PRELIMINARY; PRT; 199 AA. 

AC Q7T224; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4-C. 

GN RTN4 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=128322 88 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; 



SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF2723251 CRC64; 



Query Match 48.0%; Score 874.8; DB 13; Length 199; 

Best Local Similarity 76.8%; Pred. No. 6.1e-29; 

Matches 175; Conservative 11; Mismatches 12; Indels 30; Gaps 1 

Qy 133 DDEPPARPPPPPPAGASPIJVEPAAPPSTPAAPKRRGSGSSWDLLYWRDIKKTGWFGAS 192 

I : I I II I I I I II I I I I I I II I I I I 

Db 2 DSQP SGWKDKWDLLYWRDIKKTGVVFGAS 31 

Qy 193 LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 252 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I I : I 
Db 32 LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESDVAVS 91 

Qy 253 EELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 312 

I : I : II I I : I I I : I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 92 EDLIQKYSSVVLGHINGTVKELRRLFLVDDLVX>SLKFAVIW^FTWGALFNGLTLLILA 151 

Qy 313 LI SLFS I PVI YERHQVQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAD 360 

I I I I I I : I I I I I I I I I I I I I I I I I I : II I I I I I I I I I I M I I I I : 
Db 152 LISLFSVPVIYERHQAQIDHYLGLVNKNVKDAMAKIQAKIPGLKRKTE 199 



RESULT 13 
Q7TNB7 

ID Q7TNB7 PRELIMINARY; PRT; 720 AA. 

AC Q7TNB7; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L-, 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 



RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC STRAIN=C57BL/6; TISSUE=Brain ; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC056373; AAH56373.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 720 AA; 77435 MW; 8 0AB7 872 8 Fl 6EAB2 CRC64; 



Query Match 46.9%; Score 855.6; DB 11; Length 720; 

Best Local Similarity 46.9%; Pred. No. l.le-26; 

Matches 210; Conservative 28; Mismatches 66; Indels 144; Gaps 18; 

Qy 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I M I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEE-EEEDDEDLEELEVLERK 59 

Qy 61 PAAGLSAAAVPPAAAAPLLDFSSDSVP PAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 12 0 

I I I I I I I ! Ml I I I I I I I I I II I I I I I I I II I I I I I I I I I I II I I I I M I i I I I I I 
Db 60 PAAGLSAAPVPP-AAAPLLDFS S DSVPPAPRGPLPAAP PTAPERQPSWERS PAASAPS LP 118 

Qy 121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWR 180 

I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 119 PAAAVLPSKLPEDDEPPAR — P P APAGAS P LAE P AAP P S T PAAP KRRG S G S — VD 169 

Qy 181 DIKKTGWFGASLFLL LSLTVFSI 204 

: I I I III: 
Db 170 ETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASL 219 

Qy 205 VSVTAYIALALLSVTISFRI — YKGVIQAIQKSDEG HPF — R 242 

I I : I I I : I I : I I : I : I I I : I I I 

Db 220 PS LSPLS-TVSFKEHGYLGNLSAV-ASTEGTIEETLNEASRELPERATNPFVNR 271 

Qy 243 AYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLV DDLVDSLKF A 290 

II I : : : I I : : : : I : I I I I I 

Db 272 ESAEFSVLEYSEMGSSFNGSPKGESAMLVENTKEEVIVRSKDKEDLVCSAALHNPQESPA 331 

Qy 291 VLMWVFT YVGA LFNGLTLLILALI SLFSIPV 1 YE 324 

II I : | | : : :: | | | | | 

Db 332 TLTKWKEDGVMSPEKTMDIFNEMKMSWA PVREEYADFKPFEQAWEVKDTYE 384 

Qy 325 RHQVQI DHYLGLANKSVKDAMAKIQAKI 352 

: : I I I : : : I : 
Db 385 GSRDVLAARANMESKV 400 



RESULT 14 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 
DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE Hypothetical protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Osada N., Hida M. , Kusuda J., Tanuma R., Iseki K., Hirai M. , Terao K. , 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries."; 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB049853; BAB16739.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD4 3 83FEE9E02 CRC64; 



Query Match 46.3%; Score 844; DB 6; Length 179; 

Best Local Similarity 95.5%; Pred. No. 8.9e-28; 

Matches 171; Conservative 6; Mismatches 2; Indels 0; Gaps 0 

QY 182 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 241 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | | | | | | | | | : | | | | | | | | | | | | | | | 
Db 1 MKKTGWFGAS L FLLLS LTVFS I VS VTAYI ALALLS VT I S FRI YRGVI QAI QKS DEGHP F 60 

Qy 242 RAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGA 301 

I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I | | M | | | || | | | : | | | | | | | || | | | | 
Db 61 RAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLRFAVLMWFTYVGA 120 

QY 302 LFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 360 

I I I M I I I M I II I I I I : I I I I I I I I | | | | | | | || | | : | || | | | | | | | || | | | | | | | : 
121 LFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 179 



RESULT 15 
Q8K0T0 

ID Q8K0T0 PRELIMINARY; PRT; 780 AA. 

AC Q8K0T0; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to reticul on 1 (Hypothetical protein) . 

GN RTN1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Retina; 

RA Strausberg R. ; 

RL Submitted (MAY-2 002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 



RC TISSUE=01factory epithelium; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L . , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S . , Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y . , Bouffard G. G . , 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. f Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=01f actory epithelium; 

RA Strausberg R. ; 

RL Submitted (JUN-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC030455; AAH30455.1; -. 

DR EMBL; BC053926; AAH53926.1; -. 

DR MGD; MGI:1933947; Rtnl. 

DR GO; GO: 00057 83; C : endoplasmic reticulum; IDA. 

DR InterPro; IPR001951; Histone_H4. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HISTONE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 780 AA; 83572 MW; 29B47A58FC2F2027 CRC64; 



Query Match 45.3%; Score 826.2; DB 11; Length 780; 

Best Local Similarity 28.8%; Pred. No. 2.2e-25; 

Matches 186; Conservative 57; Mismatches 105; Indels 298; Gaps 12 

Qy 11 SSSTDSPPRPPP AFKY 26 

I I : I I I I : I 1 

Db 137 SESPEEPETPGPSLPEVPGMEPQGLLSSDSGIEMTPAESTEWKILADPLDQMKAEAYKY 196 



Qy 27 QFVTEPEDEEDEEEE EDEEED — DEDLE 52 

: I I :: : : M : II:: I I : I I 

Db 197 IDITRPQEAKGQEEQHPGLEDKDLDFKDKDTEVSTKAEGVRAPNQPAPVEGKLIKDHLFE 256 

Qy 53 52 

Db 257 ESTFAPYIDELSDEQHRVSLVTAPVKITLTEIEPPLMTATQETIPEKQDLCLKPSPDTVP 316 



Qy 



53 



52 



Db 317 TVTVSEPEDDSPGSVTPPSSGTEPSAAESQGKGSVSEDELIAAIKEAKGLSYETTESPRP 37 6 

Qy 53 ELEVLERKPAAGLSA 67 

I : I : : II I 

Db 377 VGQVADKPKTKTRSGLPTIPSPLDQEASSAESGDSEIELVSEDPMASEDALPSGYVSFGH 436 

Qy 68 -AAVPPAAAAPLLDFS SDSVPPAPRGPL 94 

: 11= hi = =1 II II I I 

Db 437 VSGPPPSPASPSIQYSILREEREAELDSELIIESCDASSASEESPKREQDS-PPMKPGAL 495 

QY 95 PA AP PAAPERQPSWERS PAAPAP — SLPPAAAVLPSKLPEDDEPPARPPPPPPAG 147 

I I I I I I : I I I I I I I I : I : : | | | 

Db 496 DAI REETGSRATEERAPS-HQGPVEPDPMLSFAPAAALQSRPEPS SGDGASVPEPPRS QQ 554 

QY 148 ASPLAEPAAPPSTPAA PKRRGSG SSWDLLYWRDIKKTGWFGASLF 194 

I i : : I I I III : I I I I I I I I I : I I : I I I : I 

Db 555 QKPEEEAVSSSQSPTATEIPGPLGSGLMPPLPFFNKQKAIDLLYWRDIKQTGIVFGSFLL 614 

Qy 195 LLLSLTVFSIVSVTAYIALALLSWISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 254 

M Ml I I : I M I I : I I I II I I I I I I I I I : M : I I : I I I I I I : I I I I I : : I : I 
Db 615 LLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQE 674 

Qy 255 LVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALI 314 

: I I I : : : I I I I : I I I I I I I I I I I I I I I M I M I I : I I I I I I I I I I I I I :: I : : 
Db 675 QIQKYTDCLQLYWSTLKELRRLFLVQDLVDSLKFAVLMWLLTWGALFNGLTLLLMAW 734 

Qy 315 SLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 360 

I : I :: I 1 : I : I I I : I III! : : I I I I I I I M II I : 

Db 735 SMFTLPWYVKHQAQVDQYLGLVRTHINTWAKIQAKIPGAKRHAE 780 

Search completed: September 29, 2004, 18:54:39 
Job time : 81.3939 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 29, 2004, 18:21:11 ; Search time 14.9792 Seconds 

(without alignments) 
1251.419 Million cell updates/sec 

SEQ2_1-171_975-1163 
1823 

1 MEDIDQSSLVSSSTDSPPRP VKDAMAKIQAKIPGLKRKAD 360 

BLOSUM62 

Gapop 10.0 , Gapext 0.1 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 




Description 


1 


1732.7 


95. 


0 


1163 


1 


RTN4_ 


RAT 


Q9jkll 


rattus norv 


2 


1518 .4 


83. 


3 


1192 


1 


RTN4_ 


'human 


Q9nqc3 


homo sapien 


3 


914 


50. 


, 1 


199 


1 


RTN4_ 


[mouse 


Q99p72 


mus musculu 


4 


826.3 


45. 


3 


776 


1 


RTN1_ 


[human 


Q16799 


homo sapien 


5 


807 .4 


44. 


,3 


111 


1 


RTN1_ 


[rat 


Q64548 


rattus norv 


6 


659.9 


36. 


,2 


237 


1 


RTN3_ 


[mouse 


Q9es97 


mus musculu 


7 


654 . 6 


35, 


, 9 


236 


1 


RTN3_ 


[human 


095197 


homo sapien 


8 


585.3 


32. 


. 1 


545 


1 


RTN2_ 


[human 


075298 


homo sapien 


9 


555.3 


30. 


.5 


471 


1 


RTN2_ 


mouse 


070622 


mus musculu 


10 


272.1 


14. 


.9 


786 


1 


prgr" 


"chick 


P07812 


gallus gall 


11 


259 


14. 


.2 


1059 


1 


CAPu[ 


_drome 


Q24120 


drosophila 


12 


257.6 


14. 


.1 


2715 


1 


MLL4] 


HUMAN 


Q9umn6 


homo sapien 


13 


257.5 


14. 


.1 


1206 


1 


FM14_ 


_MOUSE 


Q05859 


mus musculu 


14 


256.4 


14 . 


.1 


3530 


1 


MY15_ 


HUMAN 


Q9ukn7 


homo sapien 


15 


254.9 


14, 


. 0 


1468 


1 


FMN1_ 


_MOUSE 


Q05860 


mus musculu 


16 


253.9 


13, 


.9 


1790 


1 


SEPA_ 


EMENI 


P78621 


emericella 


17 


253.7 


13, 


.9 


1157 


1 


BBCl" 


YEAST 


P47068 


saccharomyc 



18 


253. 


2 


13. 


9 


1255 


1 


DIAl_MOUSE 


008808 


mus musculu 


19 


252. 


5 


13. 


9 


1248 


1 


DIA1_HUMAN 


060610 


homo sapien 


20 


248. 


6 


13 . 


6 


555 


1 


GPl_CHLRE 


Q9fpq6 


chlamydomon 


21 


246. 


4 


13 . 


5 


1657 


1 


STNB CAEEL 


P90761 


caenorhabdi 


22 


246 


13. 


5 


1522 


1 


PST1 SCHPO 


Q09750 


schizosacch 


23 


245 . 


2 


13. 


5 


3511 


1 


MY15_MOUSE 


Q9qzz4 


mus musculu 


24 


244. 


7 


13. 


4 


465 


1 


FXD1_HUMAN 


Q16676 


homo sapien 


25 


242. 


7 


13 . 


3 


1533 


1 


Y58 6_HUMAN 


Q9bvv6 


homo sapien 


26 


242. 


1 


13 . 


3 


1375 


1 


BNRl YEAST 


P40450 


saccharomyc 


27 


241. 


1 


13. 


2 


775 


1 


ICP0_HSV11 


P08393 


herpes simp 


28 


240. 


9 


13. 


2 


980 


1 


FTKLJDEIRA 


Q9rxb5 


deinococcus 


29 


240. 


9 


13 . 


2 


1427 


1 


ZFH2_HUMAN 


Q9c0al 


homo sapien 


30 


240. 


6 


13 . 


2 


933 


1 


PRGR HUMAN 


P06401 


homo sapien 


31 


240. 


5 


13. 


2 


1446 


1 


IE18_PRVKA 


P33479 


pseudorabie 


32 


240. 


2 


13. 


2 


1164 


1 


FHOS_HUMAN 


Q9y613 


homo sapien 


33 


238 . 


2 


13 . 


1 


2517 


1 


NCR2__HUMAN 


Q9y618 


h nuclear r 


34 


238 


13 . 


1 


3149 


1 


TEGU_EBV 


P03186 


eps tein-bar 


35 


237. 


5 


13 . 


0 


830 


1 


JIP2_MOUSE 


Q9ere9 


mus musculu 


36 


237. 


1 


13. 


0 


902 


1 


IF2 BRAJA 


Q89wa9 


bradyrhizob 


37 


236. 


6 


13 . 


o 


2167 


1 


SHK1 RAT 


Q9wv4 8 


rattus norv 


38 


236. 


.5 


13 . 


0 


1395 


1 


IF4G HUMAN 


Q04637 


homo sapien 


39 


236. 


3 


13. 


0 


1402 


1 


IF4G RABIT 


P41110 


oryctolagus 


40 


236. 


3 


13. 


0 


2142 


1 


BAT2_HUMAN 


P48634 


homo sapien 


41 


236 


12. 


9 


1132 


1 


BAT3_HUMAN 


P46379 


homo sapien 


42 


236 


12. 


9 


1332 


1 


M4K6 HUMAN 


Q8n4c8 


homo sapien 


43 


235. 


,5 


12, 


9 


925 


1 


IF2_GLOVI 


Q7nh85 


gloeobacter 


44 


235. 


, 5 


12. 


9 


1461 


1 


IE18 PRVIF 


P11675 


pseudorabie 


45 


234. 


. 9 


12. 


9 


824 


1 


JIP2 HUMAN 


Q13387 


homo sapien 



ALIGNMENTS 



RESULT 1 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte ; 

RX MEDLINE=99249816; PubMed=10231557 ; 

RA Morris N.J., Ross S.A., Neveu J.M., Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 

RL Biochim. Biophys. Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 



RX MEDLINE=20129258; PubMed=106677 96 ; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L., 

RA Spillmann A. A. , Christ F., Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-l."; 

RL Nature 4 03:434-439(2 000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4) . 

RC STRAIN-Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=22033691; PubMed=12 037 567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration. '*; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=Nogo-A, NI-220-250; 

CC IsoId=Q9JKll-l; Sequence=Displayed; 

CC Name=2; Synonyms =Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP2 0; 

CC IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 

CC Name=4; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659 ; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 
CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF051335; AAF01564.1; 

DR EMBL; AJ242961; CAB71027.1; -. 

DR EMBL; AJ242962; CAB71028.1; -. 



DR 


EMBL; AJ242963; 


CAB71029.1; 




DR 


EMBL; AF132045; 


AAD31019. 1; 




DR 


EMBL; AF132 04 6; 


AAD31020. 1; 




DR 


GO; GO: 0030176; 


C 


: integral 


to endoplasmic reticulum membrane; 


DR 


GO; GO: 0005635; 


C 


:nuclear membrane; ISS. 


DR 


GO; GO: 0005515; 


F 


:protein binding; ISS. 


DR 


GO; GO: 0019987; 


P 


negative 


regulation of anti-apoptosis ; ISS. 


DR 


GO; GO: 0030517; 


P 


negative 


regulation of axon extension; ISS . 


DR 


InterPro; 


IPR0033* 


38 ; Reticulon. 


DR 


Pfam; PF02453; 


Reticulon; 1 




DR 


PROSITE; 


PS50845; 


RETICULON; 1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


DOMAIN 


1 




989 


CYTOPLASMIC ( Potential ) 


FT 


TRANSMEM 


990 




1010 


POTFNTT AT. 


FT 


DOMAIN 


1011 




1104 


LUMENAL I Potent i al ) 


FT 


TRANSMEM 


1105 




1125 


POTENTIAL. 


FT 


DOMAIN 


1126 




1163 


CYTOPLASMIC (Potential). 


FT 


DOMAIN 


976 




1163 


RETICULON. 


FT 


DOMAIN 


33 




46 


POLY-GLU. 


FT 


DOMAIN 


73 




76 


POLY-ALA. 


FT 


DOMAIN 


140 




145 


POLY-PRO. 


FT 


VARSPLIC 


1 




964 


Missing (in isoform 3) . 


FT 










/FTId=VSP_005656. 


FT 


VARSPLIC 


965 




975 


AVLSAELSKTS -> MDGQKKHWKDK (in i 


FT 










3) . 


FT 










/FTId=VSP_005657. 


FT 


VARSPLIC 


173 




975 


Missing (in isoform 2) . 


FT 










/FTId=VSP_005658 . 


FT 


VARSPLIC 


192 




975 


Missing (in isoform 4) . 


FT 










/FTId=VSP_005659. 


FT 


CONFLICT 


1130 




1131 


MISSING (IN REF. 3; AAD31020). 


SQ 


SEQUENCE 


1163 


AA; 126386 


MW; 8CB894B09E94F0B6 CRC64; 



Query Match 95.0%; 
Best Local Similarity 31.0%; 
Matches 360; Conservative 



Score 1732.7; DB 1; 
Pred. No. 3.2e-50; 
0; Mismatches 0; 



Length 1163; 
Indels 8 03; Gaps 



l; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

I I I I I I I I I I I I I II 1 I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I M I I I I I I I I I I 
1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 



61 



120 



PAAGL S AAAVP P AAAAP L LD FS S D S VP P AP RG P L P AAP PAAP E RQ P S WERS P AAP AP S L P 
I I M I I I I I II I I I I I I I I I I I I I I I I I I I || | | I | | ! | | | | | | | | | | | | | | | | | | | | || 
61 PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAAPAPSLP 120 

121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGS 171 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
121 PAAAVLPSKLPEDDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSVDETLFALP 180 

Y12 171 

181 AASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSLSPLSTVSFKEHGYL 24 0 
172 171 

241 GNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYSEMGSSFKGSPKGES 300 



Qy 172 i 71 

Db 301 AI LVENTKEEVI VRSKDKEDLVCSAALHS PQES PVGKEDRWS PEKTMDI FNEMQMS WA 360 

QY 172 171 

Db 361 PVREEYADFKPFEQAWEVKDTYEGSRDVLAARANVESKVDRKCLEDSLEQKSLGKDSEGR 42 0 

Qy 172 171 

Db 421 NEDASFPSTPEPVKDSSRAYITCAS FTSATESTTANTFPLLEDHTSENKTDEKKIEERKA 4 80 

Qy 172 171 

Db 481 QIITEKTSPKTSNPFLVAVQDSEADYVTTDTLSKVTEAAVSNMPEGLTPDLVQEACESEL 540 

Qy 172 171 

Db 541 NEATGTKIAYETKVDLVQTSEAIQESLYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSL 600 

Qy 172 171 

Db 601 LPSAGASWQPSVSPLEAPPPVSYDSIKLEPENPPPYEEAMNVALKALGTKEGIKEPESF 660 

Qy 172 171 

Db ^61 NAAVQETEAPYISIACDLIKETKLSTEPSPDFSNYSEIAKFEKSVPEHAELVEDSSPESE 720 

Qy 172 171 

Db 721 PVDLFSDDSIPEVPQTQEEAVMLMKESLTEVSETVAQHKEERLSASPQELGKPYLESFQP 780 

Qy 172 171 

Db 781 NLHSTKDAASNDIPTLTKKEKISLQMEEFNTAIYSNDDLLSSKEDKIKESETFSDSSPIE 840 

Qy 172 171 

Db 841 IIDEFPTFVSAKDDSPKLAKEYTDLEVSDKSEIANIQSGADSLPCLELPCDLSFKNIYPK 900 

Qy 172 171 

Db 901 DEVHVSDEFS ENRS SVS KAS I S PSNVSALEPQTEMGS I VKS KSLTKEAEKKLP S DTEKED 960 

QY 172 SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 217 

I I I I I I I I I I I M I I I I I I II I I I I I I I I | | | | | | | M I I I I I I I I 

Db 961 RSLSAVLSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLS 1020 

Qy 218 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 277 

I II M I I I I I I I I I I | || I I II I I I I I I I I I I I | | || | | | | | | | | | | | | | | | | 

Db 1021 VTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL 1080 

QY 278 FLVDDL VD S L K FAVLMWVFT YVGAL FN GLT LL I LAL I S L FS I P VI YERHQVQ I DH YL GLA 337 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M I I I I I I I I II I I 

Db 1081 FLVDDLVDSLKFAVLMWVFT YVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLA 1140 

Qy 338 NKSVKDAMAKIQAKIPGLKRKAD 360 



Db 1141 NKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 2 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA. 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 
DE (Neuroendocrine-specific protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA0886. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20129242; PubMed=10 6677 8 0 ; 

RA Prinjha R. , Moore S.E., Vinson M. , Blake S., Morrow R., Christie 

RA Michalovich D . , Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384(2000). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2). 

RC TISSUE=Brain; 

RX MEDLINE=2 1010696; PubMed=l 1 12 63 6 0 ; 

RA Tagami S., Eguchi Y. , Kinoshita M., Takeda M. , Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity." 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=20237542; PubMed=10773680; 

RA Yang J., Yu L . , Bi A.D., Zhao S.-Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4-->2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4). 

RA Jin W.-L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Fibroblast; 

RA Yutsudo M. ; 



RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDB J databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RC TISSUE=Pituitary; 

RA Song H., Peng Y. , Zhou J. f Huang Q., Dai M. , Mao Y.M., Yu Y. , Xu X., 

RA Luo B., Hu R., Chen J.; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y. f Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y. , 

RA Yu J., Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed=10048485 ; 

RA Nagase T., Ishikawa K.-I., Suyama M. , Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3). 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=124 77 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. f Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. f Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T . L . , Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RX MEDLINE=20499367; PubMed=11042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M., Zhao C.-J., Fu G- , 

RA Shen Y., Fan H.-Y., Lu G . , Zhong M. , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z.; 



RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34 + hematopoietic 

RT stem/progenitor cells. 1 '; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4). 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1). 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M. ; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed=106677 97 ; 

RA GrandPre T., Nakamura F. , Vartanian T., Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 

RX MEDLINE=21069055; PubMed=l 12017 42 ; 

RA Fournier A.E., Grandpre T., Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 

RT regeneration."; 

RL Nature 4 09:341-34 6(2 001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=11891768 ; 

RA Ng C.E.L., Tang B.L.; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration."; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

cc Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2 . 

CC This is likely consecutive to their change in subcellular 

CC location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed; 

CC Name=2; Synonyms=RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2 ; Sequence=VSP_005655 ; 

CC Name=3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

cc IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP 005653; 

CC Name=4 ; 

CC IsoId=Q9NQC3-4 ; Sequence=VSP_005654 ; 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



-!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 
and testis and weakly in heart and skeletal muscle. Isoform 2 is 
widely expressed excepted for the liver. Isoform 3 is expressed in 
brain, skeletal muscle and adipocytes. Isoform 4 is testis- 
specif ic . 

-!- SIMILARITY: Contains 1 reticulon domain. 

CAUTION: Ref.ll sequence differs from that shown due to 
frameshifts in positions 1149 and 1156. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 



AJ251383; 
AJ251384; 
AJ251385; 
AB040462; 
AB040463; 
AF148537; 
AF148538; 
AF087901; 
AF320999; 
AF132047; 
AF132048; 
AB015639; 
AF077050; 
AF177332; 
AB020693; 
BC001035; 
BC007109; 
BC014366; 



CAB99248.1 
CAB99249.1 
CAB99250.1 
BAB18927 
BAB18928 
AAG12176 
AAG12177 
AAG12205 
AAG40878 
AAD31021 . 1 
AAD31022.1 
BAA83712.1 
AAD27783.1 
AAG17976. 1 
BAA74909.1 
AAH01035.1 
AAH07109.1 
AAH14366.1 



Query Match 83.3%; Score 1518.4; DB 1; Length 1192; 

Best Local Similarity 27.3%; Pred. No. 5.6e-43; 

Matches 326; Conservative 12; Mismatches 20; Indels 836; Gaps 



Qy 


1 


MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 
1 1 1 = 1 II 1 1 1 1 I I I I I | | 1 I M 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 : 1 1 :: I I I | II 1 1 1 1 1 1 1 
MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 


60 


Db 


1 


58 


Qy 


61 


PAAGLSAAAVP PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 

N 1 1 1 1 M II III 1 1 1 : 1 1 : 1 1 M 1 1 1 1 II || I I | | | | || | | | : | | : 1 
PAAGL S AAP VP T APAAGAP LMD FGNDFVP PAP RG P L P AAP P VAP ERQPSWDPSPVSS T VP 


115 


Db 


59 


118 


Qy 


116 


APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 

Ml 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 | | | 1 | | | | | | | || | | M 

APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 


166 


Db 


119 


178 


Qy 


167 


1 1 1 


170 


Db 


179 


RGSSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSL 


238 



Qy 



171 



170 



Db 239 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 298 

Qy 171 17Q 

Db 299 EMGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDE 358 

Qy i 71 17Q 

Db 359 VVSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLES 418 

Qy 171 170 

Db 419 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 478 

Qy 171 170 

Db 479 IFPLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKV 538 

Qy 171 170 

Db 539 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 598 

Qy 171 _ 1?0 

Db 599 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPP 658 

Qy 171 170 

Db 659 YEEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 718 

Qy 171 170 

Db 719 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 778 

Qy 171 170 

Db 779 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 838 

Qy 171 170 

Db 839 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 898 

Qy 171 17Q 

Db 899 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 958 

2^ 171 S S WDLLYWRDI KKTG 186 

: I I I I I I I I II i I I I I 

Db 959 ATQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTG 1018 

Qy 187 WFGASLFLLLSLTVFSIVSVTAYIAL^lLLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 24 6 

I I M I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1019 WFGASLFLLLSLTVFSIVSVTAYI7VIALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1078 

Qy 247 SEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 306 

I I I I I I II I I I I I I I I I I I I I I II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 



Db 1079 SEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVIJ^^WFTWGALFNGL 1138 



307 TLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAD 360 
I I I I I M I I I I I : I I I I I I I I I I I I I I | | | | | : | | | | | | | | | | | | | | | | | | | ; 
TLLI LALI SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



Db 1139 



RESULT 3 
RTN4_MOUSE 

ID RTN4_MOUSE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-3T3-L1; TISSUE=Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 
RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Embryo ; 

RX MEDLINE=21085660; PubMed=112 17 851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y., 

RA Arakawa T., Hara A., Fukunishi Y., Konno H. , Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T . , Okazaki Y. , Gojobori T., Bono H., Kasukawa T., Saito R. , 

RA Kadota K. , Matsuda H.A., Ashburner M. , Batalov S., Casavant T., 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L., Washio T., 

RA Sakai K., Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G. , 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M., Hume D.A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J. , Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N., 

RA Sasaki H., Sato K. , Schoenbach C, Seya T . , Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K., Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 

CC membrane of the endoplasmic reticulum through 2 putative 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



transmembrane domains (By similarity) . 
ALTERNATIVE PRODUCTS : 

Event=Alternative splicing; Named isof orms=l; 

Comment=A number of isoforms may be produced; 
Name=l ; 

IsoId=Q99P72-l; Sequence=Displayed; 
SIMILARITY: Contains 1 reticulon domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF326337; AAK08076.1; 

EMBL; AK003859; -; NOT_ANNOTATED_CDS . 

MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

GO; GO:0030176; Ctintegral to endoplasmic reticulum membrane; ISS. 
GO; GO: 0005635; Crnuclear membrane; ISS. 
GO; GO: 0005515; F:protein binding; ISS. 

GO; GO:0019987; P:negative regulation of anti-apoptosis ; ISS. 
GO; GO: 0030517; P:negative regulation of axon extension; ISS. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



FT 


DOMAIN 


1 


25 


CYTOPLASMIC (Potential) . 


FT 


TRANSMEM 


26 


50 


POTENTIAL. 


FT 


DOMAIN 


51 


137 


LUMENAL (Potential) . 


FT 


TRANSMEM 


138 


162 


POTENTIAL. 


FT 


DOMAIN 


163 


199 


CYTOPLASMIC (Potential) . 


FT 


DOMAIN 


12 


199 


RETICULON. 


SQ 


SEQUENCE 


199 AA; 


22466 


MW; 07BE5D580059ED9C CRC64 



Query Match 50.1%; Score 914; DB 1; Length 199; 

Best Local Similarity 83.3%; Pred. No. 1.7e-24; 

Matches 190; Conservative 3; Mismatches 5; Indels 30; 



Gaps 



2; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



133 DDEPPARPPPPPPAGASPLAEPAAPPSTPAAPKRRGSGSSWDLLYWRDIKKTGWFGAS 192 
I I : I : i I I I I II I I I I I I I I I I I I M 

2 DDQ KKRWK-DKWDLLYWRDIKKTGWFGAS 31 



193 



32 



253 



92 



313 



LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 252 
I I M I I I I I II I I I I II I I I I I I I I I I II I I I I I I I I I I I I I || I | | | | | | | | | | | | M | 
LFLLLSLTVFS IVSVTAYIALALLS VTI S FRI YKGVIQAIQKS DEGHPFRAYLES EVAI S 9 1 

EELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 312 
I I M I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | | M I I I II I I I I I I I I I I I I I I I I I 
EELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 151 



LI SLFS I PVI YERHQVQI DHYLGLANKS VKDAMAKIQAKI PGLKRKAD 360 
I I I I I I I I I I II I I I I I I I I || | | I | I I | I I I | | | | | | | | | | | | || : 
152 LISLFSIPVIYERHQAQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAE 199 



RESULT 4 
RTN1 HUMAN 



ID RTN1_HUMAN STANDARD; PRT; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-A; RTN1-B AND RTN1-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7 685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 

RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specific gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 268:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=96429995; PubMed=8 833145 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding reticulons, "; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE-98228245; PubMed=9560466 ; 

RA Hens J., Nuydens R. , Geerts H. f Senden N.H., Van de Ven W.J.M., 

RA Roebroek A. J., van de Velde H.J.K., Ramaekers F.C., Broers J.L.; 

RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:22 9-237(1998). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=RTNl-A; Synonyms=NSP-A; 

CC IsoId=Q167 99-l; Sequence=Displayed; 

CC Name=RTN 1 - B ; Synonyms=NSP-B; 

CC IsoId=Q167 9 9-2; Sequence=VSP_00564 4 ; 

CC Name=RTN 1 - C ; Synonyms=NSP-C; 

CC IsoId=Q16799-3; Sequence=VSP_005645, VSP_005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTNl-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC -!- PTM: Isoforms RTNl-A and RTN1-B are phosphorylated. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and .the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 

DR EMBL; L10334; AAA59951.1; 

DR EMBL; L10335; AAA59952.1; 

DR PIR; A46583; A46583. 

DR PIR; 160904; 160904. 

DR Genew; HGNC: 104 67; RT.N1. 

DR MIM; 600865; 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; TAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0030182; P : neuron differentiation; TAS. 



DR 


GO; GO: 0007165; P: 


signal 


transduction; NAS . 


DR 


InterPro; 


IPR0033E 


J8; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 


1. 




DR 


PROSITE; PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphoryl 


ation. 








FT 


TRANSMEM 


603 


623 




POTENTIAL. 


FT 


TRANSMEM 


726 


746 




POTENTIAL, 


FT 


DOMAIN 


589 


776 




RETICULON. 


FT 


DOMAIN 


609 


612 




POLY-LEU. 


FT 


VARSPLIC 


1 


420 




Missing (in isoform RTN1-B) . 


FT 










/FTId=VSP_005644. 


FT 


VARSPLIC 


1 


568 




Missing (in isoform RTN1-C) . 


FT 










/FTId=VSP_005645. 


FT 


VARSPLIC 


569 


588 




GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 


FT 










KSQ (in isoform RTN1-C) . 


FT 










/FTId=VSP 005646. 


SQ 


SEQUENCE 


776 AA; 


83617 


MW; 


CA5B6232353096FE CRC64; 



Query Match 45.3%; Score 826.3; DB 1; Length 776; 

Best Local Similarity 27.7%; Pred. No. 4.2e-20; 

Matches 195; Conservative 52; Mismatches 91; Indels 367; Gaps 17; 

Qy 10 VSSSTDS PPR 19 

Ml: I ||: 
Db 85 VSSAMDHTFSTTSKDGEGSCYTSLISDICYPPQEDSTYFTGILQKENGHVTISESPEELG 144 

Qy 20 -PPP AFKYQFVTEPED 34 

11 I : I I : I I I : 

Db 145 TPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDITRPEE 204 

Qy 35 EEDEEEEEDEEED DED 50 

: :|: I II I I 

Db 205 VKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDKPAPVEGKIIKDHLLEESTFAPYI 264 

Qy 51 LEELE 55 

I I : I 

Db 265 DDLSEEQRRAPQITTPVKITLTEIEPSVETTTQEKTPEKQDICLKPSPDTVPTVTVSEPE 324 



Qy 56 VLERKPAAGLS 66 

: I I I I I 

Db 325 DDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEAKGLSYETAENPRPVGQIADRP 384 

Qy 67 AAAVPPAAAAPLLDFSSDSV PPA 89 

I : I I : I I I : I I I : 

Db 385 EVKARSGPPTIPSP-LDHEASSAESGDSEIELVSEDPMAAEDALPSGYVSFGHVGGPPPS 443 

Qy 90 PRGPL PAAPPA 100 

II III 
Db 4 44 PASPSIQYSILREEREAELDSELIIESCDASSASEESPKREQDSPPMKPSALDAI REETG 503 

Qy 101 "-APERQPS WERSPAAPAPSLPPA AAVLPSKLPEDD E 135 

INN : : Mill : I I I I I : I : 

Db 504 VRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGALEPETPMLPRK-PEEDSSSNQ 562 



QY 136 PPARPPPPPPAGASPLAEP7VA.PPSTPAAPKRRGSGSSVVDLLYWRDIKKTGVVFGASLFL 195 
N III MM I : : : I I I I M I I I : I I : I M : I I 

563 SPAATKGPGPLG PGAPPPLLFLNKQK AI DLLYWRDI KQTGIVFGS FLLL 611 



Db 



QY 196 LLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEEL 255 

I Ml 11 = 111 Ihlll M I I I I I I I I I : I I : II : I I I II I : II M I : : I : I 
Db 612 LFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQ 671 

Qy 256 VQKYSNSAiGHWSTIKELRRLFLVDDLVDSL 315 

: I I I : : : I I I I : I II I II I I I I I I I I I I I I I I I I : I I II I I M I I I I I : : I : : I 
Db 672 IQKYTDCLQFYWSTLKELRRLFLVQDLVDSLKFAVI>IWLLTWGALFNGLTLLLMAVVS 731 

Qy 316 LFS I PVI YERHQVQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAD 360 

• I :: I I : I : I I I I I I I I I : : I I I I I I I I I I I I : 

Db 732 MFT L P WYVKH QAQ I DQ YL G LVRT H I NAWAKI QAK I P GAKRHAE 776 



RESULT 5 
RTN1_RAT 

ID RTN1_RAT STANDARD; PRT; 777 AA. 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) . 

GN RTNl OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-B AND RTN1-S) . 

RC STRAIN=Wistar; TISSUE-Brain cortex; 

RX MEDLINE=96386034; PubMed=8 793864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 
cc membrane trafficking in neuroendocrine cells. 



CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Even t=Alter native splicing; Named isoforms=2; 

CC Name=RTNl-B; Synonyms=S-RexB; 

CC IsoId=Q6454 8-l; Sequence=Displayed; 

CC Name=RTNl-S; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTN1-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES. 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN. DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 
DR 


EMBL; U17604; AAC53046.1; 




DR 


EMBL; U17603; AAC53045.1; 




DR 


InterPro 


; IPR00338 


8; Reticulon. 


DR 


Pfam; PF02453; Reticulon; 


1. 


DR 


PROSITE; 


PS50845; 


RETICULON; 1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


TRANSMEM 


604 


624 


POTENTIAL. 


FT 


TRANSMEM 


727 


747 


POTENTIAL. 


FT 


DOMAIN 


590 


777 


RETICULON. 


FT 


DOMAIN 


610 


613 


POLY-LEU. 


FT 


VARSPLIC 


1 


569 


Missing (in isoform RTN1-S) . 


FT 








/FTId-VSP_005647 . 


FT 


VARSPLIC 


570 


589 


IPGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTN1-S) . 


FT 








/FTId=VSP_005648 . 


SQ 


SEQUENCE 


777 AA; 


83001 


MW; AF7479C50F28D0AC CRC64 ; 


Query Match 




44.3^ 


b; Score 807.4; DB 1; Length 777; 


Best Local i 


Similarity 


27. 9^ 


b; Pred. No. 1.8e-19; 


Matches 18 


9 ; Conservative 


61; Mismatches 101; Indels 326; Gaps 


Qy 


7 


SSLVS 




SSTDSPPR — ppp ; 



15; 



: I I : I :: :: I I | | 

Db 104 TSLISDICYPPREDSAYFTGILQKENGHITTSESPEELGTPGPSLPEVPGTEPHGLLSSD 163 

Qy 23 AFKYQFVTEPEDEEDEEEE EDEEED — DE 49 

III :| I:: : :||: I I : : I |: 

Db 164 SGIEMTPAESTEVNKILADPLDQMKAEACKYIDITRPQEAKGQEEQSPGLEDKDLDFKDK 223 



Qy 50 DLE 52 

Db 224 DSEVSTKPEGVHAPNQPSPVEGKLIKDNLFEESTFAPYIDELSDEQHRMSLATTAPVKITL 283 

Qy 53 52 

Db 284 TEIGPPVMTATHETIPEKQDLCLKPSPDTVPTVTVSEPEDDSPGSVTPPSSGTEPSAAES 343 

Qy 53 52 

Db 344 QGKGSVSEDELIAAIKEAKGLSYETTESPRPVGQAADRPKVKARSGLPTIPSSLDQEASS 4 03 

Qy .53 ELEVLERKPAAGLSA AAVPPAAAAPLLDFS 82 

I : I : : II I : I I : I : I : : I 

Db 404 AESGDSEIELVSEDPMASEDALPSGYVSFGHVSGPPPSPASPSIQYSILREEREAELDSE 463 

Qy 83 SDSVPPAPRGPLPA AP P AAP ERQPSWERS PAAPAP S 118 

I I I I 111 : I I I I I : I I I 

Db 464 LIIESCDAS SASEESPKREQDS-PPMKPGVLDAI REETSSRATEERAPS-HQGPVEPDPI 521 

Qy 119 LPPAAAVLPSKLPEDDEPPARPPPPPPAG — AS P LAE P AAP P S T P AAP KRRG S - G S S W- 174 

I Ihll I I I I II: : I I I : I I I : I 

Db 522 LSFTPVTLQSR-PEPSSGDGAPVPEPPKSQQQKPEEEAVSSSQSPAATEIPGPLGSDLVP 580 

Qy 175 DLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFR 223 

I I I I I I I I I : I I : I I I : I II III I I : I I I I I : I I I II I I I I I 

Db 581 PLPFFNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFR 640 

Qy 224 IYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDL 283 

III I : I I : II : I I I I I I : I I I I I : : I : I : I I I : : : I I I I : I I I I I I I I I I I 

Db 641 IYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDL 700 

Qy 284 VDSLKFAVLMWFTWGALFNGLTLLIIAilSLFSIPVIYERHQVQIDHYLGIANKSVKD 343 

I I I I II I I I I I : II I I I I I I I I I I I : : i : : I : I : : I I : I : I I I : I MM : 

Db 701 VDSLKFAVLMWLLTWGALFNGLTLLLMAVVSMFTLPVVWKHQAQV 760 

Qy 344 AMAKIQAKI PGLKRKAD 360 

: I I I I I I I I I II I : 
Db 7 61 WAK I QAKI P GAKRHAE 777 



RESULT 6 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT ; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 



RA Huang X., Zhou Y., Qiang H., Yuan J., Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog. "; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. f Gibbs R.A., 

RA Fahey J., Helton E. f Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; 

DR EMBL; BC014697; AAH14697.1; 

DR MGD; MGI: 1339970; Rtn3. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL . 

FT TRANSMEM 167 187 POTENTIAL . 

FT DOMAIN 49 237 RETICULON. 

SQ SEQUENCE 237 AA; 25428 MW; EB60A0A7AC4 5F0DE CRC64; 

Query Match 36.2%; Score 659.9; DB 1; Length 237; 

Best Local Similarity 49.8%; Pred. No. 9.6e-16; 

Matches 129; Conservative 47; Mismatches 52; Indels 31; Gaps 4; 



Qy 109 ERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLA EPAAPPSTPA 162 



•II I I ill- I 

SGAEPSALGGGGGSPGACPALGA 4 0 

QY 163 APKRRGSGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISF 222 

I M : I I I I I I : I I I I I I I : I : I I I I I I : : II : I : II I II I I I II 
Db 41 --KSCGSSCAVHDLIFWRDVKKTGFVFGTTLIMLLSIAAFSVISWSYLIIALLSVTISF 98 

QY 223 RIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDD 282 

1:11 IMI:|I|:|IIII:MI: :: :| I I |:|: ||| :| : |||||:| 

Db 99 RVYKSVIQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHVNKALKLIIRLFLVED 158 

Qy 2 83 LVDS LKFAVLMWVFTYVGALFNGLTLLI LALI SLFS I PVI YERHQVQI DHYLGLANKSVK 342 

I I M I I II II: I I I I I : I I I : I I I I II : : I I : I : : I I : : : | | | | | : | : | | 
Db 159 LVDSLKLAVFMWLMTYVGAVFNGITLLILAELLVFSV^ 218 

Qy 343 DAMAKIQAKIPGL-KRKAD 360 

: I I I I I : I I : I : I I : 
Db 219 SIVEKIQAKLPGIAKKKAE 237 

RESULT 7 
RTN3 HUMAN 



ID RTN3_HUMAN STANDARD; PRT; 236 AA. 

AC 095197; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) (NSP-like 

DE protein II) (NSPLII). 

GN RTN3 OR NSPL2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE-Retina; 

RX MEDLINE=99265974; PubMed=l 0331 947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3." ; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Du G., Yuan J., Qiang B . ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1."; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 



Ill : I I : : : 

Db 3 ESSAATQSPSVSSSS- 



RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential). 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1; -. 

DR EMBL; BC000634; AAH00634.1; -. 

DR EMBL; BC010556; AAH10556.1; -. 

DR EMBL; BC011394; AAH11394.1; 

DR EMBL; BC022993; AAH22993.1; -. 

DR Genew; HGNC: 104 69; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 8 8 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 4 8 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4 54 4ABCDFB7 CRC64; 



Query Match 35.9%; Score 654.6; DB 1; 

Best Local Similarity 47.8%; Pred. No. 1.4e-15; 
Matches 129; Conservative 44; Mismatches 57; 



Length 236; 

Indels 40; Gaps 



4; 



Qy 96 AAPPAAPE RQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPL 151 

MM: I : I : I I III 
Db 2 AEPSAATQSHSISSSSFGAEPSAPG = GGGSPG 32 

Qy 152 AEPAAPPSTPAAPKRRGSGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI 211 

III I I : I I I II I : I I I I I I I : I : I I I I I I : : I I : I : 

Db 33 ACPA LGTKSCS SSCAVHDLI FWRDVKKTGFVFGTTLIMLLSLAAFSVI SWSYL 8 6 

Qy 212 ALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTI 271 

I I I I I I I I I I I I I I I I I I : I I I : I I I I I : I I I :::: I I I I : I : I : | : 

Db 87 ILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAMVHINRAL 146 

Qy 272 KELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLTLLILALISLFSIPVIYERHQVQID 331 

I : 11111:1111111 II II: I I I I I : I I I : I I I I I I : : | | : I : : I I : : : | | | 
Db 147 KLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAELLIFSVPIVYEKYKTQID 206 

Qy 332 HYLGLANKSVKDAMAKIQAKIPGL-KRKAD 360 

I I : I : I I : II I II : I I : 1 : I I : 

Db 207 HYVGIARDQTKSIVEKIQAKLPGIAKKKAE 236 



RESULT 8 
RTN2_HUMAN 

ID RTN2_HUMAN STANDARD; PRT; 545 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A., ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693 037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A. (ISOFORM RTN2-B) . 

RC TISSUE=Brain; 

RX MEDLINE-98191726; PubMed=9530 622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTN2 -A; 

CC IsoId=075298-l; Sequence=Displayed; 



CC Note=Isof orm RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 

CC IsoId=075298-2; Sequence=VSP_00564 9; 

CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; -. 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; NAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Alternative initiation . 



FT 
FT 
FT 



CHAIN 


1 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


CHAIN 


341 


545 


RETICULON PROTEIN 2, ISOFORM RTN2 


INIT_MET 


341 


341 


FOR ISOFORM RTN2-C. 


TRANSMEM 


368 


388 


POTENTIAL. 


TRANSMEM 


463 


483 


POTENTIAL. 


DOMAIN 


345 


545 


RETICULON. 


VARSPLIC 


272 


344 


Missing (in isoform RTN2-B) . 








/FTId-VSP 005649. 


) SEQUENCE 


545 AA; 


59263 


MW; 971FD2F909E1E9E6 CRC64; 


Query Match 




32.1%; Score 585.3; DB 1; Length 545; 


Best Local Similarity 


27.5^ 


Pred. No. 2.3e-12; 


Matches 154; 


Conservative 


52; Mismatches 117; Indels 237; 



Gaps 17; 

Qy 8 SLVSSSTDSPPRPPPAFKYQFVTE-PEDEED EEEEEDEEED 47 

111:11 II I : I I II I I I I 

Db 16 STASSTPDS TEGGNDDSDFRELHTAREFSEEDEEETTSQDWGTPREL 62 

Qy 4 8 DEDLEELEVLERKPAAG 64 

: I I : I : I I 

Db 63 TFSYIAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESIPSLSQSPEPG 122 



QY 



65 LSAAAVPPAAAAPLLDFSSDSVPPAPR- 

I : II: I 



GPL 94 
I I 



Db 



123 RRG- 



DPDTAPPSERPLEDLRLRLDHLGWVARGTGSGEDSSTSSSTPL 168 



Qy 


95 


Db 


169 


Qy 


130 


Db 


226 


Qy 


150 


Db 


286 


Qy 


174 


Db 


346 


Qy 


234 


Db 


406 


Qy 


294 


Db 


466 


Qy 


354 


Db 


526 



-QP S WERS PAAPAP S L PPAAAVLPSK — 129 

III II II II l|: 



-LPEDDE PPARPP PPPPAGAS 14 9 

I l::| III II: 



-PLAEPAAP PSTPA APKRRG SGSSV 173 

I : I : I Ml : I : I | | | 



I I I I I : I : : I I I I : Ml | M I II I :: I I II II I I : I : | : | | 



KSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLM 2 93 
: I : I I : I I I : : : : : I : : | : I I : | | II I : I I I I I II I : I 

RGDGANPFQAYLDVDLTLTREQTERLSHQITSRWSAATQLRHFFLVEDLVDSLKLALLF 4 65 



|:IM:|IIIMMI :| ll:||::| :|| Ml |:|| : |||:|| 



525 



-KRKAD 360 
I II: 



RESULT 9 
RTN2_MOUSE 

ID RTN2_MOUSE STANDARD; PRT; 471 AA. 

AC 070622; O70620; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN-FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE=98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Retina; 

RX MEDLINE=22388257; PubMed=12477932 ; 



RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler CD., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer CF., Bhat N.K.," 

RA Hopkins R.F., Jordan H. , Moore T . , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin CM., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E . , 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak 3. A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l ; Synonyms=Brain; 

CC Isold=07 0 622-1; Sequence=Displayed; 

CC Name-2 ; Synonyms=Muscle; 

C C IsoId=O70622-2; Sequence=VSP_005650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; 

DR EMBL; AF038537; AAC14907.1; -. 

DR EMBL; AF038538; AAC14908.1; -. 

DR EMBL; AF038539; AAC14909.1; -. 

DR EMBL; AF093624; AAD13195.1; -. 

DR EMBL; BC031370; AAH31370.1; 

DR MGD; MGI: 107612; Rtn2 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 295 315 POTENTIAL. 

FT DOMAI N 272 471 RET I CU LON . 

FT VARSPLIC 1 267 Missing (in isoform 2). 

FT / FT I d=VS P_ 0 0 5 6 5 0 . 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2). 

FT /FTId=VSP 005651. 



SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 30.5%; Score 555.3; DB 1; Length 471; 

Best Local Similarity 27.0%; Pred. No. 1.7e-ll; 

Matches 140; Conservative 51; Mismatches 100; Indels 227; Gaps 13 

Qy 8 SLVSSSTDSPPRPPPAFKYQFVTEP EDEEDEEEEEDEEEDDEDLEE 53 

I M : I I II | | | | | | | :: | 

Db 16 STASSTPDS TEGGNDDSDFRELHTAREFSEDEEEETTSQDWGTPRE- 61 

Qy 54 LEVLERKPAAGLSAAAVPPAAAAPLLDFS SDSV — PPAPRGPLPAAP 98 

I I I III I I : I : I 

Db 62 LTFSYIAFDGWGSGGRRDSWRRPRPQGRSVSEP 96 

Qy 99 PAAPERQ PSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPAR — PPPPPPAG 147 

I : : I I : I I I I I I I I I I 

Db 97 RDPPQQSGLGDSLESIPSLSQSP E P GRRGD P D P VP PA- 133 

Qy 14 8 ASPLAE 153 

I I I 

Db 134 ERPLEELRLRLDQLGWWRSAGSGEDSATSSSTPLENEEPDGLEASEAGEETNLELRLAQ 193 

Qy 154 PAAPPS TPAAPKRRGSGSS 172 

I II | | : : : | | 

Db 194 SLHLQLEVLTPQLSPSSGTPQAHTPSPQRSQDSNSGPDDEPLLNWEEHWRLLEQEPITA 253 

Qy 173 VVDLL YWRDI KKTGVVFGAS LFLLLSLTVFS I VSVTAYI ALA 214 

I I I I I I : I :: I I I : Ml I I I I II I : : I I 
Db 254 QCLDSTDQSEFMLEPLLLVADLLYWKDTRTSGAVFTGLMASLLCLLHFSIVSVAAHLALL 313 

Qy 215 LLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKEL 274 

I I I I I : I : I : I I : : I : I I : I I I : : : : : I : : | MM : I 

Db 314 GLCATISLRVYRKVLQAVHRGDGTNPFQAYLDMDLTLTREQTERLSQQIASHWSTATQL 373 

Qy 275 RRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQVQIDHYL 334 

I I I I : M II I I I 1:1 : : I M I I M M I I I : I I : : : II : M : M Ml III I : 

Db 374 RHFFLVEDLVDSLKLALLFYILTFVGAI FNGLTLVILGWALFTVPLLYRQHQAQIDQYV 433 

Qy 335 GLANKSVKDAMAKIQAKIPGL KRKAD 360 

II : II I M I I I I III: 

Db 434 GLVTNQLS HI KAKI RAKI P GT GT LAPTAS VS GS KAKAE 471 



RESULT 10 
PRGR_CHICK 

ID PRGR_CHICK STANDARD; PRT; 786 AA. 

AC P07812; Q90946; 

DT 01-AUG-1988 (Rel. 08, Created) 

DT 01-AUG-1988 (Rel. 08, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Progesterone receptor (PR) . 

GN PGR OR NR3C3. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI TaxID=9031; 



RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88166640; PubMed=3443098 ; 

RA Gronemeyer H., Turcotte B., Quirin-Stricker C, Bocquel M.T., 

RA Meyer M.E., Krozowski Z., Jeltsch J.M., Lerouge T., Gamier J.M. , 

RA Chambon P . ; 

RT "The chicken progesterone receptor: sequence, expression and 

RT functional analysis."; 

RL EMBO J. 6:3985-3994(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91042592; PubMed=3153474 ; 

RA Conneely O.M., Dobson A.D.W., Tsai M.-J., Beattie W.G., Toft D.O., 

RA Huckaby C.S., Zarucki T., Schrader W.T., O'Malley B.W. ; 

RT "Sequence and expression of a functional chicken progesterone 

RT receptor."; 

RL Mol. Endocrinol. 1:517-525(1987). 

RN [3] 

RP SEQUENCE OF 128-164 FROM N.A. 

RX MEDLINE=86289413; PubMed=2 42 677 9 ; 

RA Conneely O.M., Sullivan W.P., Toft D.O., Birnbaumer M. , Cook R.G., 

RA Maxwell B.L., Zarucki-Schulz T . , Greene G.L., Schrader W.T., 

RA O'Malley B.W. ; 

RT "Molecular cloning of the chicken progesterone receptor."; 

RL Science 233:767-77 0(1986). 

RN [4] 

RP SEQUENCE OF 417-490 FROM N.A. 

RX MEDLINE=8 62 87271; PubMed=2 42 6697 ; 

RA Jeltsch J.M., Krozowski Z., Quirin-Stricker C, Gronemeyer H., 

RA Simpson R.J., Garnier J.M. , Krust A., Jacob F. , Chambon P.; 

RT "Cloning of the chicken progesterone receptor."; 

RL Proc. Natl. Acad. Sci. U.S.A. 83:5424-5428(1986). 

RN [5] 

RP DIFFERENCE BETWEEN FORM 1 AND FORM 2. 

RX MEDLINE=89340509; PubMed=27 6005 9 ; 

RA Conneely O.M., Kettelberger D.M., Tsai M.-J., Schrader W.T., 

RA O'Malley B.W. ; 

RT "The chicken progesterone receptor A and B isoforms are products of 

RT an alternate translation initiation event."; 

RL J. Biol. Chem. 264:14 062-14064(1989). 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORMS A; A'; B AND B 1 ). 

RX MEDLINE=90154085; PubMed=2303488 ; 

RA Jeltsch J.-M., Turcotte B., Garnier J.-M., Lerouge T., Krozowski Z., 

RA Gronemeyer H., Chambon P.; 

RT "Characterization of multiple mRNAs originating from the chicken 

RT progesterone receptor gene. Evidence for a specific transcript 

RT encoding form A."; 

RL J. Biol. Chem. 265:3967-3974(1990). 

CC -!- FUNCTION: The steroid hormones and their receptors are involved in 
CC the regulation of eukaryotic gene expression and affect cellular 

CC proliferation and differentiation in target tissues. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=A; 

CC IsoId=P07812-l; Sequence=Displayed; 



CC Name=A* ; 

CC IsoId=P07812-2; Sequence=VSP_0037 08 , VSP_003709; 

CC Name=B; 

CC IsoId=P07812-3; Sequence=VSP_003707 ; 

CC Name=B 1 ; 

CC IsoId=P07812-4; Sequence=VSP_003707 , VSP_003708, VSP_003709; 

CC -!- TISSUE SPECIFICITY: Oviduct. 

CC -!- DOMAIN: Composed of three domains: a modulating N-terminal domain, 
CC a DNA-binding domain and a C-terminal steroid-binding domain. 

CC -!- SIMILARITY: Belongs to the nuclear hormone receptor family. NR3 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y00092; CAA68282.1; -. 

DR EMBL; M13972; AAA49034.1; -. 

DR EMBL; M37518; AAA49013.1; -. 

DR EMBL; M37518; AAA49014.1; -. 

DR EMBL; M14278; AAA49035.1; -. 

DR EMBL; M14279; AAA49038.1; -. 

DR EMBL; M14280; AAA49039.1; -. 

DR EMBL; M32732; AAA49011.1; -. 

DR EMBL; M31104; AAA49011.1; JOINED. 

DR EMBL; M32726; AAA49011.1; JOINED. 

DR EMBL; M32727; AAA49011.1; JOINED. 

DR EMBL; M32728; AAA49011.1; JOINED. 

DR EMBL; M32729; AAA49011.1; JOINED. 

DR EMBL; M32730; AAA4 9011.1; JOINED. 

DR EMBL; M32732; AAA49012.1; -. 

DR EMBL; M31104; AAA49012.1; JOINED. 

DR EMBL; M32726; AAA49012.1; JOINED. 

DR EMBL; M32727; AAA49012.1; JOINED. 

DR EMBL; M32728; AAA49012.1; JOINED. 

DR EMBL; M32729; AAA49012.1; JOINED. 

DR EMBL; M32730; AAA49012.1; JOINED. 

DR EMBL; M31104; AAA49009.1; -. 

DR EMBL; M31104; AAA49010.1; 

DR PIR; A35466; A35466. 

DR HSSP; P06401; 1A28. 

DR TRANS FAC; T00698; 

DR InterPro; IPR000536; Hormone_rec_lig . 

DR InterPro; IPR000128; Proges t_receptor . 

DR InterPro; IPR001723; Stdhrmn_receptor . 

DR InterPro; IPR008946; Str_ncl_receptor . 

DR InterPro; IPR001628; Znf_C4steroid. 

DR Pfam; PF00104; hormone_rec; 1. 

DR Pfam; PF02161; Prog_receptor ; 1. 

DR Pfam; PF00105; zf-C4; 1. 

DR PRINTS; PR00398; STRDHORMONER . 

DR PRINTS; PR00047; STROIDFINGER. 

DR ProDom; PD000035; Znf C4steroid; 1. 



DR SMART; SM00430; HOLI; 1. 

DR SMART; SM00399; ZnF_C4; 1. 

DR PROSITE; PS00031; NUCLEAR_RECEPTOR; 1. 

KW Receptor; Transcription regulation; DNA-binding; Nuclear protein; 

KW Zinc-finger; Steroid-binding; Alternative splicing. 



FT 


DOMAIN 


1 


420 


MODULATING, PRO-RICH. 


FT 


DNA_BIND 


421 


486 


NUCLEAR RECEPTOR-TYPE. 


FT 


ZN FING 


421 


441 


C4-TYPE . 


FT 


ZN FING 


457 


481 


C4-TYPE . 


FT 


DOMAIN 


487 


786 




FT 


DOMAIN 


48 


80 




FT 


VARSPLIC 


1 


127 


u±ooj.ny \in j. o d j^orui n> ano. isoioriti 


FT 








/FTTH^V^P 0 0^7(17 


FT 


VARSPLIC 


452 


458 


OHNYLCA -> TISYHCS Mn i qofnrm A 1 


FT 








isof orm B 1 ) . 


FT 








/FTId-VSP 003708. 


FT 


VARSPLIC 


459 


786 


Missing (in isoform A' and isoforn 


FT 








/FTId=VSP_003709. 


FT 


CONFLICT 


58 


58 


E -> DD (IN REF. 2) . 


FT 


CONFLICT 


480 


480 


K -> N (IN REF. 2) . 


FT 


CONFLICT 


489 


489 


G -> A (IN REF. 2) . 


FT 


CONFLICT 


577 


577 


R -> T (IN REF. 2) . 


FT 


CONFLICT 


642 


642 


M -> I (IN REF. 2) . 


SQ 


SEQUENCE 


786 AA; 


85743 


MW; 659559950BC45ED9 CRC64; 



Query Match 14.9%; Score 272.1; DB 1; Length 786; 

Best Local Similarity 15.9%; Pred. No. 0.2; 

Matches 121; Conservative 54; Mismatches 119; Indels 469; Gaps 28; 

QY 11 SSSTDSPP RPPPA FKYQFVTEPEDEEDEEEEEDEEEDD 48 

I I : I : I I : I :: I : I I : I I I I : I I I : 

Db 6 SKETRAPSSARDGAVLLQAPPSRGEAEGIDVALDGLLYPRSSDEEEEEEENEEEEEEEEP 65 

Qy 4 9 EDLEELEVLER KPAAG LSAAAVPPAAAAPLLDFSSD--SVPP 8 8 

= 1111 : I I I I I | | | | : | | 

Db 66 QQREEEEEEEEEDRDCPSYRPGGGSLSKDCLDSVLDTFLAPAAHAAPWSLFGPEVPEVPV 12 5 

QY 89 AP — RGP L PAAP PAAP E RQ P S WE R S PAAP AP 117 

I I I I I : I I I II III | | 

Db 126 APMSRGPEQKAVDAGPGAPGPSQP R P GAP L W P GAD S LN VAVKARP G P E DAS EN RAP 181 

Qy 118 SLP PAAAVLPSKLPEDDEP 136 

M I I I I | : : || 
Db 182 GLPGAEERGFPERDAGPGEGGLAPAAAAS PAAV EPGAGQDYLHVPILPLNSAFLAS 237 



137 



136 



Db 



238 RTRQLLDVEAAYDGSAFGPRSSPSVPAADLAEYGYPPPDGKEGPFAYGEFQSALKIKEEG 297 



Qy 



137 PARPPP PPPAGASPLAE PAAP PST 160 



Db 



2 98 VGLPAAPPPFLGAKAAPADFAQPPRAGQEPSLECVLYKAEPPLLPGAYGPPAAPDSLPST 357 



Qy 



Db 



161 PAAP 164 

I I I 

358 SAAPPGLYSPLGLNGHHQALGFPAAVLKEGLPQLCPPYLGYVRPDTETSQSSQYSFESLP 417 



Qy 


165 


Db 


418 


Qy 


179 


Db 


478 


Qy 


239 


Db 


513 


Qy 


276 


Db 


566 


Qy 


309 


Db 


612 


Qy 


334 


Db 


672 



-KR RGSGSSWDLL Y Hi 



II: :: : : |:|: 

[FKKLN KMKWRTLDVALQQP 512 



-RLFLVDDLVDSLKFAVLMWFTYVGALFNGLTL 308 

I : | I : :::: | :: | 

fRNLHIDDQITLIQYS W MSL 611 



-ISLFSIPVIYERHQVQIDHY 333 
: I : I : I II : : 



- GLANKS VKDAM AKIQAKI PGLKRK 358 

II ::| I I : I I I : : I 



RESULT 11 
CAPU_DROME 

ID CAPU_DROME STANDARD; PRT; 1059 AA. 

AC Q24120; Q9VQV8; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Cappuccino protein. 

GN CAPU OR CG3399. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE-96033799; PubMed=759022 9 ; 

RA Emmons S., Phan H., Calley J., Chen W., James B., Manseau L.; 

RT "Cappuccino, a Drosophila maternal effect gene required for polarity 

RT of the egg and embryo, is related to the vertebrate limb deformity 

RT locus."; 

RL Genes Dev. 9:2.4 82-24 94(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=l 0731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 



RA 
RA 
RA 



CC 
CC 
CC 



RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D-, 

RA Ballew R.M., Basu A. , Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y. f Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A. , Butler H. , Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

de Pablos B . , Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 
Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 
Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. , 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K . , 

RA Glodek A., Gong F., Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A. , Rowland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. f Karpen G.H., Ke Z., Kennison J. A. , Ketchum K.A., 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y. , Levitsky A. A. , Li J.H., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., MoshrefiV, 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese'M.G. 

RA Reinert K., Remington K., Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T.,' 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E . , 

RA Svirskas R. , Tector C, Turner R., Venter E . , Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A. f Weinstock G.M. , Weissenbach J., 

RA Williams S.M., Woodage T . , Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q. , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 287:2185-2195(2000). 

- SIMILARITY: Contains 1 Formin homology 1 (FH1) domain. 

- SIMILARITY: Contains 1 Formin homology 2 (FH2) domain. 

- SIMILARITY: Belongs to the formin homology family. Cappuccino 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U34258; AAC46925.1; 

DR EMBL; AE003578; AAF51054.1; -. 

DR PIR; T13286; T13286. 

DR FlyBase; FBgn0000256; capu. 

DR GO; GO: 0007304; P:eggshell formation (sensu Insecta) ; IMP. 

DR GO; GO: 0007316; P:pole plasm RNA localization; IMP. 

DR InterPro; IPR000269; CuNH_oxidase . 

DR InterPro; IPR003104; FH2 . 

DR InterPro; IPR001265; Formin. 

DR Pfam; PF02181; FH2 ; 1. 

DR PRINTS; PR0082 8; FORMIN. 



DR 


SMART; SM00498; 


FH2; 1. 




KW 


Developmental protein. 




FT 


DOMAIN 


480 


560 


FH1 (PRO-RICH) . 


FT 


DOMAIN 


585 


1021 


FH2. 


FT 


CONFLICT 


260 


260 


S -> C (IN REF. 1) . 


FT 


CONFLICT 


364 


364 


S -> T (IN REF. 1) . 


r 1 


CONFLICT 


386 


386 


T -> S (IN REF. 1) . 


FT 


CONFLICT 


471 


471 


E -> K (IN REF. 1) . 


FT 


CONFLICT 


495 


495 


H -> P (IN REF. 1) . 


FT 


CONFLICT 


513 


513 


MISSING (IN REF. 1) . 


SQ 


SEQUENCE 


1059 


AA; 113863 


MW; 009B0E24F61B6EA5 


Query Match 




14.2%; 


Score 259; DB 1; L< 



Best Local Similarity 16.1%; Pred. No. 1.2; 

Matches 113; Conservative 54; Mismatches 104; Indels 430; 



Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



27 
11 



4 IDQSSLVS 

: : : II I 

34 0 LE S AS LAS LGAGGVAGS LAT I AT AS TAS S DNQKTLQQI LKKRLLNCTTLAEVHAWNELL 399 



42 



12 SSTDSPPRPP PAFKYQF VTEPEDEEDEEEEE 

II I IN I II:: I : I : : | : | 

400 SSVDEPPRRPSKRCVNLTELLNASEATVYEYNKTGAEGCVKSFTDAETQTESEDCEGTCK 459 

43 DEEEDDEDLEELEVLERKPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGP 93 

M Ml Ml III III: I I I I 

460 CGQSSTKVSDNESAKEDGE KPHA VAPPPPPPPPPLHAFVAPPPPPPPPPP 509 

94 LP AAPPAAPERQPSWERSPAAPAPS LPPAAAVLPSK L 130 

I Ml I I :| I |: || : ||| | 

510 PPPPLANYGAPPPPPPPPPGSGSAPPPPPPAPIEGGGGIPPPPPPMSASPSKTTISPAPL 569 



131 PEDDE- 



-AGAS 14 9 



PP ARPPPPPP 

I : I M Mill : | : | 

570 PDPAEGNWFHRTNTMRKSAVNPPKPMRPLYWTRIVTSAPPAPRPPSVANSTDSTENSGSS 629 

150 PLAEPA AP P S T P AAP KRRGS GS S WDLL YWRD I KKT 185 

I M 111:111 I : I : : I 

630 P DE P PAAN GADAP PT AP PAT KE IWTEIEETPLDNIDEFTELFSRQAIAP 678 

186 GWFGASLFLLLSLTVFSIVSVTAYIALALLSVT 219 

I : : : : | : | 

679 VSKPKELKVKRAKSIKVLDPERSRNVGII WRSLHVPSSE 717 

22 0 ISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRL-- 277 

I H : : : : | : | | : | | | : : I I | : | : 

718 IEHAIY HIDTSV-VSLEALQHMSN IQATEDELQRIKE 753 



278 



-FLVD- 
I I : I 



-DLVD 285 



754 AAGGDIPLDHPEQFLLDISLISMASERISCIVFQAEFEESVTLLFRKLETVSQLSQQLIE 813 
286 S — 



~ L K FAVLMWV FT YVGAL FN G 

I I I:::: : |: || 
814 SEDLKLVFSIILTLGNYM NGGNRQRGQADGFNLDILGKLKDVKSKESHTTLLHFIV 869 



LTLL 309 

I I I 



: I 

Db 870 RTYIAQRRKEGVHPLEIRLPIPEPADVERAAQMDFEEVQQQIFDLNKKFLGCKRTTAKVL 929 

Qy 312 ALI SLFS I PVI YERHQVQI DHYLGLANKS VKDAMAKIQAKI 352 

I I I I I :::::: I : I I I | , : : 

Db 930 AA SRPEIMEPFKSKMEEFVEGADKS MAKLHQSL 962 



RESULT 12 
MLL4_HUMAN 

ID MLL4_HUMAN STANDARD; PRT; 2715 AA. 

AC Q9UMN6; 015022; 095836; Q96GP2; Q96IP3; Q9UK25; Q9Y668; Q9Y669; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Myeloid/lymphoid or mixed-lineage leukemia protein 4 (Trithorax 

DE homolog 2) . 

GN MLL4 OR TRX2 OR HRX2 OR MLL2 OR KIAA0304 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RA Angrand P.O., Valvatne H. , Jeanmougin F. , Adamson A., 

RA van der Hoeven F. , Olsen L., Tekotte H., Huang N., Poch O., 

RA Lamerdin J., Chambon P., Losson R. , Stewart A., Aasland R. ; 

RT "Mammalian trithorax- and ASHl-like proteins: putative chromatin 

RT regulators which contain PHD fingers and SET domains."; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RA Lamerdin J.E., McCready P.M., Adamson A.W. , Burkhart-Schultz K., 

RA Garcia E . , Kyle A., Ramirez M. , Stilwagen S., Games J., Danganan L. , 

RA Bruce R. , Quan G. , Montgomery M. , Ow D., Kobayashi A., Olsen A.O., 

RA Carrano A. V. ; 

RT "Sequence analysis of a 1 Mb region in human 19ql3.1."; 

RL Submitted (NOV-1996) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE OF 111-2715 FROM N.A. (ISOFORM 1) . 

RC TISSUE=Leukocyte, and Testis; 

RX MEDLINE=20105772; PubMed=10637508 ; 

RA Huntsman D.G., Chin S.-F., Muleris M., Batley S.J., Collins V.P., 

RA Wiedemann L.M., Aparicio S., Caldas C; 

RT "MLL2 , the second human homolog of the Drosophila trithorax gene, maps 

RT to 19ql3.1 and is amplified in solid tumor cell lines."; 

RL Oncogene 18:7975-7984(1999). 

RN [4] 

RP SEQUENCE OF 816-2715 FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=97349984; PubMed=920584 1 ; 

RA Nagase T., Ishikawa K.-I., Nakajima D., Ohira M. , Seki N., 

RA Miyajima N. , Tanaka A., Kotani H., Nomura N . , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. VII. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 4:141-150(1997). 



RN [5] 

RP SEQUENCE OF 1918-2715 FROM N.A. 

RC TISSUE=Brain, and Skin; 

RX MEDLINE=22388257; PubMed-12477 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J. f Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [6] 

RP PARTIAL SEQUENCE FROM N.A. (ISOFORMS 1 AND 2). 

RC TISSUE=Bone marrow, and Placenta; 

RX MEDLINE=99339983; PubMed=l 04 09430 ; 

RA FitzGerald K.T., Diaz M.O.; 

RT "MLL2: A new mammalian member of the trx/MLL family of genes."; 

RL Genomics 59:187-192(1999). 

CC -!- FUNCTION: Possibly acts as a transcriptional regulatory factor. 

CC -!- SUBCELLULAR LOCATION: Nuclear (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; Synonyms =Long; 

CC IsoId=Q9UMN6-l; Sequence=Displayed; 

CC Name=2; Synonyms=Truncated; 

CC IsoId=Q9UMN6-2; Sequence=VSP_006668 , VSP_006669; 

CC -!- TISSUE SPECIFICITY: Widely expressed. Highest levels in testis. 
CC Also found in brain, bone marrow, heart, muscle, kidney, pancreas, 

CC spleen, thymus, prostate, ovary, intestine, colon, peripheral 

CC blood lymphocytes, and placenta. 

CC -!- DISEASE: Often amplified in pancreatic carcinomas. 

CC -!- SIMILARITY: Belongs to the TRX/MLL family. 

CC -!- SIMILARITY: Contains 1 bromodomain. 

CC -!- SIMILARITY: Contains 1 SET domain. 

CC -!- SIMILARITY: Contains 3 PHD-type zinc fingers. 

CC -!- SIMILARITY: Contains 1 CXXC-type zinc finger. 

CC -!- SIMILARITY: Contains 1 post-SET domain. 

CC CAUTION: This protein was first named MLL2 by Ref.3 and Ref.6. 

CC MLL2 corresponds to another protein located on chromsome 12 (see 

CC AC 014686) . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



cc 
cc 
cc 
cc 
cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



use by non-profit institutions as long as its content is xn no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AJ007041; CAB45385.1; 
EMBL; ADO 00 671; -; NOT_ANNOTATED_CDS . 
EMBL; AB002302; BAA20763.2; 
EMBL; AF186605; AAD56420.1; 
EMBL; AF104918; AAD17932.1; 
EMBL; AF105279; AAD26113.1; 
EMBL; BC009337; AAH09337.1; 
EMBL; BC007353; AAH07353.1; 
EMBL; AF105280; AAD26112.1; 
MIM; 606834; 
GO; GO: 0005634; 
GO; GO:0003700; 
GO; GO:0008270; 
GO; GO: 0048096; 



NAS. 



IPR003889 
IPR003888 
IPR003616 
IPR001214 
IPR002857 
IPR001965 _ 
Pfam; PF00628; PHD; 3. 
Pfam; PF00856; SET; 1. 
Pfam; PF02008; zf-CXXC; 1. 
SMART; SM00542; FYRC; 1. 

SM00541; FYRN; 1. 
SM00249; PHD; 4. 
SMART; SM00508; PostSET; 1- 
SMART; SM00317; SET; 1. 
PROSITE; PS50868; POST_SET; 
PROSITE; PS50280; SET; 1. 
PROSITE; PS01359; ZF_PHD_1; 
PROSITE; PS50016; ZF_PHD_2; 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



SMART; 
SMART; 



C: nucleus; NAS. 

F: transcription factor activity; 
F:zinc ion binding; NAS. 

P:chromatin-mediated maintenance of transcrip. 
FYrich_C. 
FYrich_N. 
PostSET. 
SET. 

Znf_CXXC. 
Znf PHD. 



NAS. 



3. 

3, 



DNA-binding; Bromodomain; Nuclear protein; Zinc-finger; Metal-binding; 
Transcription regulation; Alternative splicing; Repeat. 

37 44 A.T HOOK (BY SIMILARITY). 

HO 117 A.T HOOK (BY SIMILARITY). 

357 365 A.T HOOK (BY SIMILARITY) . 

959 1005 CXXC-TYPE. 

1201 1252 PHD-TYPE 1. 

1249 1303 PHD-TYPE 2. 

1335 1396 PHD-TYPE 3. 

1449 1471 BROMODOMAIN (DIVERGENT) . 

2574 2695 SET. 

2699 2715 POST-SET. 

26 37 POLY-GLY. 

248 255 POLY-PRO. 

362 398 ASP/GLU-RICH (ACIDIC) . 

402 771 PRO-RICH. 

808 812 POLY-GLN . 

1963 1970 POLY-PRO. 

2251 2259 POLY-PRO. 

532 582 VSARSSRVIKTPRRFMDEDPPKPPKVEVSPVLRPPITTSPP 



DNA_BIND 

DNA_BIND 

DNA__BIND 

ZN_FING 

ZN_FING 

ZN_FING 

ZN_FING 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

DOMAIN 

VARSPLIC 



FT 








VPQEPAPVPS -> PLSQSLLLPMTLQLSLSLGQWAAPTTS 


FT 








ACLDSPLWSPLLLRPRCPLTGLQL (in isoform 2) . 


FT 








/FTId=VSP_006668. 




FT 


VARSPLIC 


583 


2715 


Missing (in isoform 


2) . 


FT 








/FTId=VSP 006669. 




FT 


CONFLICT 


834 


834 


K -> E (IN REF. 6) . 




FT 


CONFLICT 


941 


941 


S -> Y (IN REF. 6) . 




FT 


CONFLICT 


1317 


1317 


E -> Q (IN REF. 6) . 




FT 


CONFLICT 


1362 


1362 


H -> Y (IN REF. 6) . 




FT 


CONFLICT 


1438 


1438 


D -> N (IN REF. 6) . 




FT 


CONFLICT 


1918 


1920 


PLA -> GTR (IN REF. 


5; AAH09337) . 


FT 


CONFLICT 


2541 


2543 


DEE -> ARG (IN REF. 


5; AAH07353) . 


FT 


CONFLICT 


2622 


2622 


D -> H (IN REF. 6) . 




SQ 


SEQUENCE 


2715 


AA; 293511 


MW; C0615B981BBEB7BF CRC64; 


Query Match 




14.1%; 


Score 257.6; DB 1; 


Length 2715; 



Best Local Similarity 12.6%; Pred. No. 12; 
Matches 114; Conservative 30; Mismatches 100; 



Indels 664; 



Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



-STDSP- 



22; 
17 



2 EDIDQSSLVSS 

11:1111: : I I 

130 EDVAPSSLRSALRSQRGRAPRGRGRKHKTTPLPPPRLADVAPTPPKTPARKRGEEGTERM 189 

18 17 

190 VQALTELLRRAQAPQAPRSRACEPSTPRRSRGRPPGRPAGPCRRKQQAVWAEAAVTIPK 249 

18 PRPPP AFKYQFVT 30 

I I I I I : II : 

250 PEPPPPWPVKHQTGSWKCKEGPGPGPGTPRRGGQSSRGGRGGRGRGRGGGLPFVIKFVS 309 

31 EPEDEEDEEEEEDE 4 4 

: I : 1:1111 : 

310 RAKKVKMGQLSLGLESGQGQGQHEESWQDVPQRRVGSGQGGSPCWKKQEQKLDDEEEEKK 3 69 

45 EEDDEDLEELEVLER KPAAGLSAAAVPPAAAAPLLDFSSDSVPPAPRGPLPAAP 9 8 

I I : : : I I I I I III ! : I I II I I I I I I I 

370 EEEEKDKEGEEKEERAVAEEMMPAAEKEEAKLPP PPL TPPAPSPPPPLPP 419 



99 PAA 101 

I : 

420 PSTSPPPPLCPPPPPPVSPPPLPSPPPPPAQEEQEESPPPWPATCSRKRGRPPLTPSQR 479 

102 101 

480 AEREAARAGPEGTSPPTPTPSTATGGPPEDSPTVAPKSTTFLKNIRQFIMPWSARSSRV 539 

102 PERQPSWERSPA APAPSLPPAAAVLPS KLPE 132 

I : I I I I I 1 I I I I I II III 

54 0 IKTPRRFMDEDPPKPPKVEVSPVLRPPITTSPPVPQEPAPVPS-PPRAPTPPSTPVPLPE 598 



133 



142 



DDEPPARPPP 

III III 

59 9 KRRSILREPTFRWTSLTRELPPPPPAPPPPPAPSPPPAPATSSRRPLLLRAPQFTPSEAH 658 

143 PPPAGASPLAEPAAPPS — TPAAPKRRGSGSSWDLLYWRDIKKTGWFGA 191 

I I I I I II II: : I I I : I I : 



Db 



659 LKIYESVLTPPPLGAPEAPEPEPPPADDSPAEPEPRAVGRT 



699 



Qy 192 SLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAI 251 

I I I I : I I I : I I I 
Db 700 NHLSLPRFAPWTTPVKA EVSPHGAP 725 

Qy 252 SEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG — 305 

II II 

Db 726 ALSNGPQTQAQ 736 

Qy 306 LTLL 309 

: I I 

Db 737 LLQPLQALQTQLLPQALPPPQPQLQPPPSPQQMPPLEKARIAGVGSLPLSGVEEKMFSLL 796 

Qy 310 ILALISLFSI PVIYERHQVQ 329 

I : I I I Mil: 
Db 7 97 KRAKVQLFKIDQQQQQKVAASMPLSPGGQMEEVAGAVKQISDRGPVRSEDESVEAKRERP 856 

Qy 330 IDHY LG1ANKSVKDAMAKIQA 350 

II III I : : : : | 

Db 857 SGPESPVQGPRIKHVCRHAAVTVLGQARAMVPEDVPRLSALPLRDRQDLATEDTSSASETE 916 

Qy 351 KIPGLKRK 358 

: I I : 

Db 917 SVPSRSRR 924 



RESULT 13 
FM14 MOUSE 



ID FM14_MOUSE STANDARD; PRT; 1206 AA. 

AC Q05859; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Formin 1 isoform IV (Limb deformity protein) . 

GN FMN OR LD. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE-92112033; PubMed=133938 0; 

RA Grusby- Jackson L., Kuo A., Leder P.; 

RT "A variant limb deformity transcript expressed in the embryonic mouse 

RT limb defines a novel formin."; 

RL Genes Dev. 6:29-37(1992). 

CC -!- FUNCTION: Is important in the morphogenesis of limb and may have a 
CC function in differentiated cells or be involved in maintaining 

CC specific differentiated states. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=5; 

CC Comment=Additional isoforms seem to exist; 

CC Name=IV; 

CC IsoId=Q05859-l ; Sequence=Displayed; 

CC Name=IA; 



CC IsoId=Q05860-l; Sequence=External ; 

CC Name=IB; 

CC IsoId=Q05860-2; Sequence=External ; 

CC Name=II; 

CC IsoId=Q058 60-3; Sequence=External ; 

CC Name=III; 

CC IsoId=Q05860-4; Sequence=External ; 

CC -!- TISSUE SPECIFICITY: It is found throughout the embryo but has a 
CC functional role only in the kidney and limb. 

CC -!- DEVELOPMENTAL STAGE: This is the isoform found in the apical 

CC ectodermal ridge and the mesenchymal compartment of the developing 

CC limb bud. 

CC -!- PTM: PHOSPHORYLATED ON SERINE AND POSSIBLY THREONINE RESIDUES. 

CC -!- SIMILARITY: Contains 1 Formin homology 1 (FH1) domain. 

CC SIMILARITY: Contains 1 Formin homology 2 (FH2) domain. 

CC -!- SIMILARITY: Belongs to the formin homology family. Cappuccino 

CC subfamily . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X62379; CAA44244.1; -. 

DR PIR; S24407; S24407. 

DR MGD; MGI: 101815; Fmn. 

DR InterPro; IPR003104; FH2 . 

DR InterPro; IPR001265; Formin. 

DR Pfam; PF02181; FH2 ; 1. 

DR PRINTS; PR00828; FORMIN. 

KW Nuclear protein; Developmental protein; Alternative splicing; 

KW Phosphorylation; Coiled coil. 

FT DOMAIN 418 443 COILED COIL (POTENTIAL) . 

FT DOMAIN 497 566 COILED COIL (POTENTIAL) . 

FT DOMAIN 644 744 FH1 (PRO-RICH). 

FT DOMAIN 759 1164 FH2 . 

FT DOMAIN 1043 1116 COILED COIL (POTENTIAL) . 

FT DOMAIN 635 638 POLY-SER. 

FT DOMAIN 751 755 POLY-SER. 

SQ SEQUENCE 1206 AA; 133464 MW; 4DFB38CB52BD8EE7 CRC64; 



Query Match 14.1%; Score 257.5; DB 1; Length 1206; 

Best Local Similarity 14.1%; Pred. No. 1.8; 

Matches 95; Conservative 39; Mismatches 95; Indels 445; Gaps 22; 

Qy 13 STDSPPRPPP AFKYQFV 29 

III 1 Mil: 

Db 373 STDQESHKSPRDAHVQGGQVKART PETAL EAFKALFIRPPKKGSTADTSELEALKRKMKH 432 

Qy 30 TEPEDEEDEEEEEDE EEDD 48 

: I I : : I : I : I I 
Db 433 EKESLRAVFERSKSRPADSPSDPKSPDQSPTEQDDRTPGRLQAVWPPPKTKDTEEKVGLK 492 



QY 



49 



EDLEELEVLER- 



59 



i 1 I : I I : 

Db 493 YTEAEYQAAILHLKREHKEEIETLQAQFELKTFHIRGEHALVTARLEEAI ENLKQQLEKR 552 

Qy 60 KPAAGLSAAA 69 

II I I 

Db 553 REGCEEMRDVCISTDDDCSPKAFRNVCIQTDRETFLKPCDAESKATRSSQIVPKKLTISL 612 

Qy 7 0 VPPAAAAPLLDFSSDSVPPAP 90 

II II : I I I 

Db 613 TQLSPSKDSKDIHAPFQTREGTSSSSQQKISPPAPPTPPPLPPPL IPPPPPLP 665 

Qy 91 — RGPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPP 145 

! I I I I I I I 1:11 II I : : I Ml I I I I I I 

Db 666 PGLGPLPPAPPIPPV CPVSPPPPPPPPP PTPVPPSDGPPPPPPPPPPLPN 715 

Qy 14 6 AGASP LAEPAAP PSTPAAPKRRG 168 

: I I I I I I II I I 
Db 716 VLALPNSGGPPPPPPPPPPPPGLAPPPPPGLSFGLSSSSSQYPRKPAIEPSCPMKP 771 

Qy 169 SGSSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 228 

I I I 

Db 772 LYWTR 776 

Qy 22 9 IQAIQKSDEGHP FRAYLESEVAI SEELVQ 2 57 

II I I : I I II I- I 

Db 777 IQINDKSQDAAPTLWDSLEEPHIRDTSEFEYLFSKDTTQQKKKPLSEAYEKKNKVKKIIK 836 

Qy 258 KYSNSA LGHWSTIKELRR-LFLVDDLVDSLKFAVIJ^WVFTYVGALFNGLTLL 309 

||: : : : : I : : : : : I I I I 
Db 837 LLDGKRSQTVGI LI SSLHLEMKDIQQAI FTVDD 869 

Qy 310 ILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ 349 

: : : | : : : I I I : : : I : I I : 

Db 870 — SWDLETLAALYE NRAQEDELTKIRKYYETSKEEDLKLLDKPEQF 914 

Qy 350 AKIPGLKRKA 359 

I : I I : I 

Db 915 LHELAQI PNFAERA 928 



RESULT 14 




MY15_ 


HUMAN 




ID 


MY15 HUMAN STANDARD; PRT ; 3530 AA. 




AC 


Q9UKN7; 




DT 


16-OCT-2001 (Rel. 40, Created) 




DT 


16-OCT-2001 (Rel. 40, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Myosin XV (Unconventional myosin-15) . 




GN 


MY015A OR MY015. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 


Homo . 


OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A., SUBCELLULAR LOCATION, AND TISSUE 


SPECIFICITY. 


RX 


MEDLINE=20021762; PubMed=10552926 ; 




RA 


Liang Y., Wang A., Belyantseva I. A., Anderson D.W., 


Probst F.J., 



RA Barber T.D., Miller W. , Touchman J.W., Jin L., Sullivan S.L., 

RA Sellers J.R., Camper S.A. , Lloyd R.V., Kachar B., Friedman T.B., 

RA Fridell R.A. ; 

RT "Characterization of the human and mouse unconventional myosin XV 

RT genes responsible for hereditary deafness DFNB3 and shaker 2."; 

RL Genomics 61:243-258(1999). 

RN [2] 

RP PARTIAL SEQUENCE FROM N.A. , AND VARIANTS DFNB3 TYR-2111 AND PHE-2113. 

RX MEDLINE=98267311; PubMed=9603736; 

RA Wang A. , Liang Y. , Fridell R.A. , Probst F.J., Wilcox E.R., 

RA Touchman J.W., Morton CC, Morell R.J., Noben-Trauth K., Camper S.A. f 

RA Friedman T.B.; 

RT "Association of unconventional myosin MY015 mutations with human 

RT nonsyndromic deafness DFNB3."; 

RL Science 2 80:1447-1451(1998). 

CC -!- FUNCTION: Myosins are actin-based motor molecules with ATPase 
CC activity. Unconventional myosins serve in intracellular movements. 

CC Their highly divergent tails are presumed to bind to membranous 

CC compartments, which would be moved relative to actin filaments (By 

CC similarity) . May play a role in the formation or maintenance of 

CC the actin-rich structures of the inner ear sensory hair cells. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- TISSUE SPECIFICITY: Highly expressed in pituitary. Also expressed 

CC at lower levels in adult brain, kidney, liver, lung, pancreas, 

CC placenta and skeletal muscle. Not expressed in brain. In the 

CC pituitary, highly expressed in anterior gland cells. 

CC -!- DISEASE: Defects in MY015A are the cause of autosomal recessive 

CC nonsyndromic deafness type 3 (DFNB3) [MIM: 600316] . 

CC -!- SIMILARITY: Contains 1 myosin-like globular head domain. 

CC -!- SIMILARITY: Contains 1 FERM domain. 

CC -!- SIMILARITY: Contains 3 IQ domains. 

CC -!- SIMILARITY: Contains 1 MyTH4 domain. 

CC -!- SIMILARITY: Contains 1 SH3 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF144094; AAF05903.1; -. 

DR EMBL; AF05197 6; -; NOT_ANNOTATED_CDS . 

DR PIR; A59266; A59266. 

DR HSSP; P10587; 1BR2 . 

DR Genew; HGNC:7594; MY015A. 

DR MIM; 602666; -. 

DR MIM; 600316; -. 

DR GO; GO:0007605; P:hearing; TAS . 

DR InterPro; IPR000299; Band_4 . 1 . 

DR InterPro; IPR000048; IQ_region. 

DR InterPro; IPR001609; myosin_head. 

DR InterPro; IPR000857; MyTH4 . 

DR InterPro; IPR001452; SH3. 

DR Pfam; PF00612; IQ; 3. 

DR Pfam; PF00063; myosin_head; 1. 



DR 


Pfam; PF00784; MyTH4; 2. 






DR 


PRINTS; 


PR00193; 


MYOSINHEAVY 




DR 


ProDom; 


PD000355 


; myosin_ 


head; 1. 


DR 


SMART; SM00295; 


B41; 1. 






DR 


SMART; SM00015; 


IQ; 3. 






DR 


SMART; SM0 0242; 


MYSc; 1. 






DR 


SMART; SM00139; 


MyTH4; 2. 






DR 


SMART; SM00326; 


SH3; 1. 






DR 


PROSITE; 


PS00660; FERM_1; 


FALSE_NEG. 


DR 


PROSITE; 


PS00661; FERM_2; 


FALSE_NEG. 


DR 


PROSITE; 


PS50057; FERM_3; 


1. 




DR 


PROSITE; 


PS50096; IQ; 3. 






DR 


PROSITE; 


PS50002; SH3 ; FALSE 


NEG. 


KW 


Myosin; 


ATP-binding; Actin-binding; Coiled coil; Repeat; SH3 domain 


KW 


Calmodulin-binding; Disease 


mutation; Deafness. 


FT 


DOMAIN 


1 


1887 




MYOSIN HEAD-LIKE. 


FT 


DOMAIN 


1888 


2029 




NECK OR REGULATORY DOMAIN. 


FT 


DOMAIN 


2030 


3530 




TAIL. 


FT 


DOMAIN 


1323 


1350 




COILED COIL (POTENTIAL) . 


FT 


DOMAIN 


1792 


1799 




ACTIN-BINDING (POTENTIAL) . 


FT 


DOMAIN 


1902 


1924 




IQ 1. 


FT 


DOMAIN 


1925 


1954 




IQ 2. 


FT 


DOMAIN 


1955 


1976 




IQ 3. 


FT 


DOMAIN 


2867 


2953 




SH3. 


FT 


DOMAIN 


3209 


3530 




FERM. 


FT 


NP_BIND 


1315 


1322 




ATP (POTENTIAL) . 


FT 


VARIANT 


2111 


2111 




N -> Y (in DFNB3; family from Bengkala 


FT 










/FTId=VAR_010303. 


FT 


VARIANT 


2113 


2113 




I -> F (in DFNB3; Indian family) . 


FT 










/FTId=VAR 010304. 


SQ 


SEQUENCE 3530 


AA; 395171 


MW; 3D103923D4BCBE4A CRC64; 



Query Match 



14.1%; Score 256.4; DB 1; Length 3530; 
Best Local Similarity 10.1%; Pred. No. 26; 
Matches 145; Conservative 54; Mismatches 124; Indels 1106; Gaps 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



-TEPEDEEDEEE- 



37; 
40 



17 PPRPP PAFKYQFV 

Ml | | | : | Mill: 

394 PPEVP YFYPEESASAFVYPWVP P P I PS PHNPYAHAMDDIAELEEPEDAGVERQGTS FRLP 453 



41 



EEDEE 45 

: I I I 

454 SAAFFEQQGMDKPARSKLSLIRKFRLFPRPQVKLFGKEKLEVPLPPSLDIPLPLGDADEE 513 



57 



4 6 EDDEDLEELEVL 

I I : I : I = : 

514 EDEEELPPVSAVPYGHPFWGFLTPRQRNLQRALSAFGAHRGLGFGPEFGRPVPRPATSLA 573 



65 



58 ERKPAAGL 

I : I I I I 

574 RFLKKTLSEKKPIARLRGSQKTRAGGPAVREAAYKRFGYKLAGMDPEKPGTPIVLRRAQP 633 



66 



65 



634 RARSSNDARRPPAPQPAPRTLSHWSALLSPPVPPRPPSSGPPPAPPLSPALSGLPRPASP 693 



Qy 



66 



S AAAVP PAAAA- 



76 



Db 6 94 yGSLRRHPPPWAAPAHVPPAPQASGWAFVEPPAVSPEVPPDLLAFPGPRPSFRGSRRRGA 753 



Qy 



77 PLLDFSSDSVPPAPR GPL — PAAPP 99 

I I : I I I : I : II I II 

Db 754 AFGFPGASPRASRRRAWSPLASPQPSLRSSPGLGYCSPLAPPSPQLSLRTGPFQPPFLPP 813 



Qy 



10 0 A APERQ PSWERSPAAPAPSLPP AAAVLPSKL 130 

| | I | I I : I I II : : I I I : I 

Db 814 ARRPRSLQESPAPRRAAGRLGPPGSPLPGSPRPPSPPLGLCHSPRRSSLNLPSRLPHTWR 873 



Qy 131 PEDDE 135 

II I I 

Db 874 RLSEPPTRAVKPQVRLPFHRPPRAGAWRAPLEHRESPREPEDSETPWTVPPLAPSWDVDM 933 

Qy 136 -PPARPPPPPPAGA SPLAEPAAP 157 

I I I I I I I I I I : I I 

Db 934 ppTQRPPSPWPGGAGSRRGFSRPPPVPENPFLQLLGPVPSPTLQPEDPAADMTRVFLGRH 993 

Qy 158 PSTPAAPK 165 

I II I II 

Db 994 HEPGPGQLTKSAGPTPEKPEEEATLGDPQLPAETKPPTPAPPKDVTPPKDITPPKDVLPE 1053 

Qy 166 RR 167 

Db 1054 QKTLRPSLSYPLAACDQTRATWPPWHRWGTLPQAAAPLAPIRAPEPLPKGGERRQAAPGR 1113 

Qy 168 167 

Db 1114 FAWMPRVQKLSSFQRVGPATLKPQVQPIQDPKPRACSLRWSCLWLRADAYGPWPRVHTH 1173 

Qy 168 GSGSS WDLLYWRDI 182 

| |:: I I: 1= 

Db H74 PQSCHLGPGAACLSLRGSWEEVGPPSWRNKMHSIRNLPSMRFREQHGEDGVEDMTQLEDL 1233 



Qy 



183 KKTGWFGASLFLLLSLTVFSIVSVT AYIALALLSVT— ISFRIY 225 

::| III:: I I hi I I I I 

Db 1234 QET TVLSNLKIRFERNLIYTYIGSILVSVNPYQMFGIYGPEQVQQY 1279 



Qy 226 K 226 

Db 1280 NGRALGENPPHLFAVANLAFAKMLDAKQNQCIIISGESGSGKTEATKLILRYLAAMNQKR 1339 

Qy 227 GVIQAIQ KSDEGHPFRAYLE S EVAI S EELVQK 258 

| : | | : : : I I : : I I h h : I 

Db 1340 EVMQQIKILEATPLLESFGNAKTVRNDNSSRFGKFVEIFLEGGVISGAITSQYLLEKSRI 1399 



QY 



259 YSNSA 263 

I I 

Db 1400 VFQAKNERNYHIFYELLAGLPAQLRQAFSLQEAETYYYLNQGGNCEIAGKSDADDFRRLL 1459 

Qy 264 LGH 266 

I I : 

Db 1460 AAMEVLGFSSEDQDSIFRILASILHLGNVYFEKYETDAQEVASWSAREIQAVAELLQIS 1519 



Qy 



267 VNSTIKELRRLF LVD — DLVDSLKFAVLM-WVFTYVGAL 302 

| | : : | : : I II I : : : h I I : I I I I 



Db 1520 PEGLQKAITFKVTETMRE--KIFTPLTVESAVDARDAIAKVLYALLFSWLITRVNALVSP 1577 

Qy 303 FNGLTLLIL ALISLFSIPVI YERHQVQ 329 

II I : I I I : I III: 

Db 157 8 RQDTLSIAILDIYGFEDLSFNSFEQLCINYANENLQYLFNKIVFQEEQEEYIREQIDWQE 1637 

Qy 330 329 

Db 1638 ITFADNQPRINLISLKPYGILRILDDQCCFPQATDHTFLQKCHYHHGANPLYSKPKMPLP 1697 

Qy 330 IDHYLGLA 337 

I II I 

Db 1698 EFTIKHYAGKVTYQVHKFLDKNHDQVRQDVLDLFVRSRTRWAHLFSSHAPQAAPQRLGK 1757 

q y 338 NKSVKDAMAKIQ AKIPGL 355 

:|: I : I:: I II I 

Db 1758 SSSVTRLYKAHTVAAKFQQSLLDLVEKMERCNPLFMRCLKPNHKKEPGL 1806 



RESULT 15 






FMN1_ 


MOUSE 






ID 


FMN1 MOUSE STANDARD; PRT; 1468 AA. 






AC 


Q05860; 






DT 


01-JUN-1994 (Rel. 29, Created) 






DT 


01-JUN-1994 (Rel. 29, Last sequence update) 






DT 


15-MAR-2004 (Rel. 43, Last annotation update) 






DE 


Formin 1 isoforms I/II/III (Limb deformity protein) . 






GN 


FMN OR LD. 






OS 


Mus mus cuius (Mouse) . 






OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostoiru. ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 


Murinae ; 


Mus . 


OX 


NCBI TaxID=10090; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


TISSUE=Kidney, and Testis; 






RX 


MEDLINE=90363291; PubMed=2392150 ; 






RA 


Woychik R.P., Maas R.L., Zeller R. , Vogt T.F., Leder 


p. ; 


of the 


RT 


"'Formins': proteins deduced from the alternative transcripts 


RT 


limb deformity gene."; 






RL 


Nature 346:850-853(1990). 






RN 


[2] 






RP 


ALTERNATIVE SPLICING. 






RX 


MEDLINE=97224459; PubMed=9119367 ; 






RA 


Wanq C.C., Chan D.C., Leder P.; 




and 


RT 


"The mouse formin (Fmn) gene: genomic structure, novel exons, 


RT 


genetic mapping."; 






RL 


Genomics 39:303-311(1997). 






RN 


[3] 






RP 


PHOSPHORYLATION . 






RX 


MEDLINE=9329617 6; PubMed=851 6300 ; 






RA 


Vogt T.F., Jackson-Grusby L., Rush J., Leder P.; 






RT 


"Formins: phosphoprotein isoforms encoded by the mouse limb deformity 


RT 


locus . " ; 






RL 


Proc. Natl. Acad. Sci . U.S.A. 90:5554-5558(1993). 






CC 


-!- FUNCTION: Is important for the morphogenesis of 


limb and 


kidney 


CC 


and may have a function in differentiated cells 


or may be 


involved 


CC 


in maintaining specific differentiated states. 







cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



-!- SUBCELLULAR LOCATION: Nuclear. 
-!- ALTERNATIVE PRODUCTS: 

Event=Alternative splicing; Named isof orms=5 ; 
Comment=Additional isoforms seem to exist; 

Name-IA; 

IsoId=Q05 860-l; Sequence=Displayed; 
Name=IB; 

IsoId-Q05860-2; Sequence=VSP_001569 ; 
Name=II; 

IsoId=Q05860-3; Sequence=VSP_001570; 
Name=III ; 

IsoId=Q05860-4; Sequence=VSP_001571, VSP_001572; 
Name=IV; 

IsoId=Q05859-l; Sequence=External ; 
-!- TISSUE SPECIFICITY: It is present in the adult kidney, testis, 

limb, ovary, brain, small intestine, salivary gland and harderian 
gland. It is present throughout the embryo. 
-!- DEVELOPMENTAL STAGE: In the developing limb bud, the protein is 
expressed in the apical ectodermal ridge and the mesenchymal 
compartment, predominantly in the posterior region. During kidney 
morphogenesis, expression is initially restricted to the 
epithelial compartment of the pronephros and mesonephros. 
PTM: PHOSPHORYLATED ON SERINE AND POSSIBLY THREONINE RESIDUES. 
SIMILARITY: Contains 1 Formin homology 1 (FH1) domain. 
SIMILARITY: Contains 1 Formin homology 2 (FH2) domain. 
SIMILARITY: Belongs to the formin homology family. Cappuccino 
subfamily. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement ( See http : //www. isb-sib . ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; X5359 
PIR; S11515 
MGD; MGI:10 
InterPro; I 
InterPro; I 
Pfam; PF021 
PRINTS; PRO 
SMART; SMOO 
Nuclear pro 
Phosphoryla 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 



9; CAA37668.1; 
; S11515. 
1815; Fmn. 
PR003104; FH2 . 
PR001265; Formin. 
81; FH2; 1. 
0828; FORMIN. 
498; FH2; 1. 

tein; Developmental protein; Alternative splicing; 
tion; Coiled coil, 



723 
870 
985 
1305 
198 
861 
885 
911 
929 
951 
966 



792 
970 
1426 
1378 
203 
864 
892 
925 
940 
962 
970 



COILED COIL (POTENTIAL) 

FH1 (PRO-RICH) . 

FH2. 



COILED COIL 
POLY-SER. 
POLY-SER. 
POLY- PRO. 
POLY- PRO. 
POLY-PRO. 
POLY- PRO. 
POLY-PRO. 



(POTENTIAL) . 



r I 




977 


981 


POLY-SER. 


r 1 




-L J i. 


1287 


Missing (in isoform IB) . 


r l 








/FTId=VSP 001569. 


FT 




DZ J 


177 


Missina (in isoform II) . 


FT 








/FTId=VSP_001570. 


FT 


VARSPLIC 


626 


627 


IA -> SV (in isoform III) . 


FT 








/FTId=VSP_001571. 


FT 


VARSPLIC 


628 


1468 


Missing (in isoform III) . 


FT 








/FTId=VSP_001572. 


SQ 


SEQUENCE 


1468 


AA; 163809 


MW; EF2FB1E9CA9DAF43 CRC64; 


Query Match 




14.0%; 


Score 254.9; DB 1; Length 



Best Local Similarity 14.5%; Pred. No. 3.5; 
Matches 97; Conservative 40; Mismatches 103; Indels 431; Gaps 23; 

Qy 2 EDIDQSSLVSSSTDSPPRP — PPA FKYQ 27 

| : : | : I I : II I I I 

Db 602 EEASEKGLGPEKITAPPQHQLPPGIASEGFPCDNFKEQTAKDLPNKDGGVWVPGYRAGPP 661 

Qy 28 — FVTEPEDEEDEEEE EDEEEDD 48 

I: I I: I hll 

Db 662 CPFLLHEEKEKTSRSELYLDLNPDQSPTEQDDRTPGRLQAWPPPKTKDTEEKVGLKYTE 721 



Qy 



49 EDLEELEVLER 59 

I I I : I I : 

Db 722 AEYQAAILHLKREHKEEIETLQAQFELKTFHIRGEHALVTARLEEAI ENLKQQLEKRREG 781 



Qy 



60 KPAAGLSAAA 69 

II II 

Db 782 CEEMRDVCI STDDDCSPKAFRNVCIQTDRETFLKPCDAESKATRS SQIVPKKLTI SLTQL 841 



Qy 



70 VPPAAAAPLLDFSSDSVPPAP R 91 

II II : M I 

Db 842 SPSKDSKDIHAPFQTREGTSSSSQQKISPPAPPTPPPLPPPL IPPPPPLPPGL 894 



Qy 92 GPLPAAPPAAPERQPSWERSPAAPAPSLPPAAAVLPSKLPEDDEPPARPPPPPP 14 5 

I 1:11 II I: :l Ml MINI 

895 GPLPPAPPIPPV CPVSPPPPPPPPP PTPVPPSDGPPPPPPPPPPLPNVLA 944 



Db 

Qy 



146 — AGASP LAEPAAP PSTPAAPKRRGSGS 171 

: | I II I I II I I 
Db 945 LPNSGGPPPPPPPPPPPPGLAPPPPPGLSFGLSSSSSQYPRKPAIEPSCPMKP 997 



0v 172 SAAADLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 231 

III I" 
998 LYWTR I Q I 1005 



Db 

Qy 



232 IQKSDEGHP FRAYLESEVAISEELVQ 257 

I | : I I II I I 

Db 1006 NDKSQDAAPTLWDSLEEPHIRDTSEFEYLFSKDTTQQKKKPLSEAYEKKNKVKKIIKLLD 1065 



Q y 258 -KYSNSA— -LGHVNSTIKELRR-LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 312 

||: : : : : | : : : : : I I I I : 
Db 1066 GKRS QTVGI LI S S LHLEMKDI QQAI FTVDD S 1096 

Qy 313 LI SLFS I PVI YERHQVQIDHYLGLANKSVKDAMAKIQ 349 

:: | :: :|| |:: :| : 11= 



1097 WDLETLAALYE NRAQEDELTKIRKYYETSKEEDLKLLDKPEQFLHE 

350 -AKI PGLKRKA 359 

I : I I : I 

1144 LAQI PNFAERA 1154 



Search completed: September 29, 2004, 18:52:07 
Job time : 29.9792 sees 



